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(54) TOle: PROTEIN MICROARRAY SYSTEM 

(57) Abstract: The present invention relates to automated methods, systems, and apparatuses protein separation and analysis. In 
particular, the present invention provides an automated system for the separation, identification, and characterization of protein sam- 
ples. The present invention thus provides improved methods for the separation and analysis of samples containing a plurality of 
proteins (e.g., cells). 
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PROTEIN MICROARRAY SYSTEM 

This Application claims priority to U.S. Provisional Patent Application 
60/439,625, filed January 1 3, 2003. 
5 The present invention was made, in part, with government funding under National 

Institutes of Health under grant No. 2-R01GM49500-5 and the National iScience 
Foundation grant No. DBI-9987220. The government has certain rights in this invention. 

FIELD OF THE INVENTION 
10 The present invention relates to automated methods, systems, and apparatuses for 

protein separation and analysis. In particular, the present invention provides an 
automated system for the separation, identification, and characterization of protdn 
samples, including the generation and analysis of protein microarrays. 

15 BACKGROUND OF THE INVENTION 

As the nucleic acid sequences of a number of genomes, including the hxmian 
genome, become available, there is an increasing need to interpret this wealth of 
information. While tlie availability of nucleic acid sequence allows for the prediction and 
identification of genes, it does not explain the expression patterns of the proteins 

20 produced from these genes. The genome does not describe the dynamic processes on the 
protein level. For example, the identity of genes and the level of gene expression does 
not represent the amount of active protein in a cell nor does the gene sequence describe 
post-translational modifications that are essential for the function and activity of proteins. 
Thus, in parallel with the genome projects there has begun an attempt to understand the 

25 proteome (z.e. , the quantitative protein expression pattern of a genome under defined 
conditions) of various cells, tissues, and species. Proteome research seeks to identify 
targets for drug discovery and development and provide information for diagnostics (eg., 
tumor markers). 

An important area of research is the study of the protem content of cells (i.e., the 
30 identity of and amount of expressed proteins in a cell). This field requires methods that 
can separate out large numbers of proteins and can do so quantitatively so that changes in 
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expression or structure of proteins can be detected. The method generally used to 
achieve such cellular protein separations is 2-D PAGE. This method is capable of 
resolvmg hundreds of proteins based upon pi in one dimension and protein size in the 
second dimension. The proteins separated by this method are visualized using a staining 
5 method that can generally be quantified. The result is a 2-dimensional image where the 
protein map is based on pi and approximate molecular weight. By the use of computer 
based image analysis techniques, one can search for proteins that are differentially 
expressed in various cell lines. These methods are used to monitor changes in protein 
expression that are linked to conditions such as cell transformation and cancer 
10 progression, cell aging, the response of cells to environmental insult, and the response of 
cells to pharmaceutical agents. Once changes in protein expression have been identified, 
then one can further analyze target proteins to determine their identity and whether they 
have been altered &om their expected structure by sequence changes or post-translational 
modifications. 

15 Although 2-D PAGE is still widely used for protein analysis, the method has 

several limitations including the fact that it is labor intensive, time consimiing, difficult to 
automate and often not readily reproducible. In addition, quantitation, especially in 
differential expression experiments, is often difficult and limited in dynamic range. Also, 
while the 2-D gel does produce an image of the proteins in the cell, the mass 

20 determination is often only accurate to 5-1 0%, and the method is difficult to interface to 
mass spectrometric techniques for further analysis. 

Another limitation of 2-D PAGE is the amount of protein loaded per gel which is 
generally below 250 \ig. The amount of protein in any given spot may therefore be too 
low for furflier analysis. For Coomassie brilliant blue (CBB) stained gels the limit of 

25 detection is 100 ng per spot while for silver stained gels the limit of detection is 1-10 ng. 
Furthermore, proteins that have been isolated in 2-D gels are embedded inside the gel 
structure and are not fi-ee in solution, thus making it difficult to extract the protein for 
fiirther analysis. Because of these limitations, the art is in need of protein mapping 
methods that are efficient, automated, and have broader resolution capabilities than 

30 presently available technologies. 
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SUMMARY OF THE INVENTION 

The present invention relates to automated methods, systems, and apparatuses for 
5 proteiQ separation and analysis. In particular, the present invention provides an 
automated system for the separation, identification, and characterization of protein 
samples. 

In some embodiments, the present invention provides a system comprising an 
apparatus configured for automated sequential capillary electrophoresis - mass 

10 spectroscopy - mass spectroscopy of at least one protein sample. In some embodiments, 
the at least one protein sample comprises a plurality of polypeptides, and wherein each of 
the at least one sample corresponds to a separated protein fi-action. In some 
embodiments, the system further comprises a software program configured for 
performing the automated sequential capillary electrophoresis - first mass spectroscopy - 

15 second mass spectroscopy of the at least one protein sample. In some embodiments, the 
software is configured for determining mass-to-charge ratio of ions contained in the at 
least one protein sample in real time. In some embodiments, the software is fiirther 
configured to apply a correct fi-equency to an ion trap of the second mass spectroscopy 
apparatus based on the mass to charge ratio. In some embodiments, the system fiirther 

20 comprises a separated protein fraction separated in two dimensions. In some 

embodiments, the system fiarther comprises a separated protein fraction separated in a 
first dimension by isoelectric point and a second dimension by hydrophobicity. In some 
embodiments, the system fiulher comprises a liquid separated protein fraction. In some 
embodiments, the apparatus is configured for automated sample preparation. In some 

25 embodiments, the apparatus fiirther comprises an automatic fraction injector configured 
for the injection of the at least one sample into the ^paratus. In some embodiments, the 
software is configured for automated sample analysis. In other embodiments, the 
software is configured for the generation of a multi-dimensional map corresponding to 
the at least one sample. In some embodiments, the multi-dimension map comprises 

30 infonnation about two or more properties of the at least one sample, the properties 

selected from the group including, but not limited to, protein mw, protein hydrophobicity. 
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protein isoelectric point, protein structure and protein sequence. In some embodiments, 
the first and second mass spectroscopy are ion trap TOF mass spectroscopy. In some 
embodiments, the ion trap TOF mass spectroscopy comprises the step of firagmenting the 
at least one sample prior to performing the ion trap TOF mass spectroscopy. In some 

5 embodiments, the at least one protein sample is enzymatically digested. In some 
embodiments, the system further comprises a plurality of the apparatuses, wherein the 
plurality of apparatuses are configured for the simxiltaneous analysis of a plurality of the 
at least one sample. 

The present invention further provides an apparatus configured for automated 

10 sequential capillary electrophoresis - mass spectroscopy - mass spectroscopy of at least 
one sample, wherein each of the at least one sample comprises a separated protein 
fi-action. In some embodiments, the apparatus further comprises a software program 
configured for performing the automated sequential c^illary electrophoresis - first mass 
spectroscopy - second mass spectroscopy of the at least one sample. In some 

15 embodiments, the software is configured for determining mass-to-charge ratio of ions 
contained in the at least one protein sample in real time. In other embodiments, the 
software is further configured to apply a correct fi-equency to an ion trap of the second 
mass spectroscopy apparatus based on the mass to charge ratio. In some embodiments, 
the separated protein fi-action is separated in two dimensions (e.g., including, but not 

20 limited to, isoelectric point and hydrophobicity). In some embodiments, the separated 
protein firaction is a liquid. In some embodiments, the apparatus is configured for 
automated sample preparation. In some embodiments, the apparatus further comprises an 
automatic firaction injector configured for the injection of the at least one sample into the 
apparatus. In some embodiments, the software is configured for automated sample 

25 analysis. In certain embodiments, the software is configured for the generation of a 

multi-dimensional map corresponding to the at least one sample. In some embodiments, 
the multi-dimension map comprises information about two or more properties of the at 
least on sample, the properties selected from the group including, but not limited to, 
protein mw, protein hydrophobicity, protein isoelectric point, protein stmcture, and 

30 protein sequence. In some embodiments, the first and second mass spectroscopy are ion 
trap TOF mass spectroscopy. In certain embodiments, the ion trap TOF mass 
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spectroscopy comprises the step of fragmenting the at least one sample prior to 
performing the ion trap TOF mass spectroscopy. 

The present invention also provides a method for the automated analysis of 
separated protein samples, comprising providing at least one sample comprising a 

5 plurality of polypeptides, wherein each of the at least one sample corresponds to an 

separated protein fraction; and an analysis apparatus configured for automated sequential 
capillary electrophoresis - mass spectroscopy • mass spectroscopy of the at least one 
sample; and treating the at least one sample with the analysis apparatus to generate 
analyzed sample. In some embodiments, the apparatus further comprises software 

10 configured for performing the automated sequential capillary electrophoresis - first mass 
spectroscopy - second mass spectroscopy of the at least one sample. In certain 
embodunents, the software is configured for determining mass-to-charge ratio of ions 
contained in the at least one protein sample in real time. In fiirther embodiments, the 
software is fiirther configured to apply a correct frequency to an ion trap of the second 

15 mass spectroscopy apparatus based on the mass to charge ratio. In some embodiments, 
the separated protein fraction is separated in two dimensions (eg., including, but not 
limited to, isoelectric point and hydrophobicity). In some embodiments, the separated 
protein fraction is a liquid. In certain preferred embodiments, the apparatus is configured 
for automated sample preparation and the treating fiirther comprises the step of 

20 automated sample preparation. In some embodiments, the automated sample preparation 
comprises automated partial dry down of the at least one sample. In certain 
embodiments, the automated sample preparation fiirther comprises automated enzymatic 
digestion of the at least one sample. In some embodiments, the apparatus fiirther 
comprises an automatic fraction injector configured for the injection of the at least one 

25 sample into the apparatus. In some embodiments, the software is configured for 

automated sample analysis. In some embodiments, the software is configured for the 
generation of a multi-dimensional map corresponding to the at least one sample. In some 
embodiments, the multi-dimension map comprises information about two or more 
properties of the at least on sample, the properties selected from the group including, but 

30 not limited to, protein mw, protein hydrophobicity, protein isoelectric point, protein 
structure, and protein sequence. In some embodiments, the first and second mass 
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spectroscopy are ion trap TOF mass spectroscopy. In certain embodiments, the ion trap 
TOF mass spectroscopy comprises the step of fragmenting the at least one sample prior to 
performing the ion trap TOF mass spectroscopy. In some embodiments, the method 
further provides a plurality of the apparatuses. In some embodiments, the treating 

5 comprising simultaneous analysis of a plurality of the at least one sample with the 
plurality of apparatuses . 

The present invention additionally provides a method for the automated analysis 
of protein samples, comprising providing at least one sample comprising a plurality of 
polypeptides; at least one separation apparatus configured for the separation of proteins 

10 based on a physical property; and an analysis apparatus configured for automated 

sequential capillary electrophoresis - mass spectroscopy - mass spectroscopy of the at 
least one sample; and treating the at least one sample with the separation apparatus to 
generate a plurality of separated polypeptide fractions; and treating at least one of the 
plurality of separated polypeptide fractions with the analysis apparatus to generate an 

15 analyzed polypeptide sample. In some embodiments, the analysis apparatus fiirther 
comprises software configured for performing the automated sequential capillary 
electrophoresis - first mass spectroscopy - second mass spectroscopy of the at least one 
sample. In some embodiments, software is configured for determining mass-to-charge 
ratio of ions contained in the at least one protein sample in real time. In certain 

20 embodiments, the software is further configured to apply a correct frequency to an ion 
trap of the second mass spectroscopy apparatus based on the mass to charge ratio. In 
some embodiments, the at least one separation apparatus comprises two separation 
apparatuses configured to separate the sample in two dimensions including, but not 
limited to, isoelectric point and hydrophobicity). In some embodiments, the separated 

25 polypeptide fraction is a liquid. In some embodiments, the analysis apparatus is 

configured for automated sample preparation and the treating fiirther comprises the step 
of automated sample preparation. In some embodiments, the automated sample 
preparation comprises automated partial dry down of the separated sample, hi fiirther 
embodiments, the automated sample preparation fiirther comprises automated enzymatic 

30 digestion of the separated sample. In some embodiments, the analysis apparatus fiirther 
comprises an automatic fiaction injector configured for the injection of the separated 
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protein sample into the analysis apparatus. In some embodiments, the software is 
configured for automated sample analysis. In certain embodiments, the software is 
configured for the generation of a multi-dimensional map corresponding to the analyzed 
sample. In some embodiments, the multi-dimension map comprises mformation about 

5 two or more properties of the at least on sample, the properties selected fi"om the group 
including, but not limited to, protein mw, protein hydrophobicity, protein isoelectric 
point, protein structure, and protein sequence. In some embodiments, the first and second 
mass spectroscopy are ion trap TOF mass spectroscopy. In some embodiments, the ion 
trap TOF mass spectroscopy comprises the step of fragmenting the at least one sample 

10 prior to performing the ion trap TOF mass spectroscopy. In some embodiments, the 
method further provides a plurahty of the analysis apparatuses. In some embodiments, 
the treating comprises simultaneous analysis of a plurality of the at least one sample with 
said plurality of analysis apparatuses. 

The present invention additionally provides a method for generating protein 

15 microarrays. In certain embodiments, a method for generating microarrays involves 
providing at least one sample comprising polypeptides. In addition, a first protein 
separation apparatus is provided. In preferred embodiments, the first protein separation 
apparatus is a chromatofocusing apparatus. In addition, a second protein separation 
^paratus is provided (e,g,, HPLC). In addition, a solid surface is provided. 

20 In certain embodiments, the sample comprising polypeptides is treated with the 

first protein separation apparatus to generate a first separated polypeptide preparation. 
Next, in preferred embodiments, the first separated polypeptide preparation is treated 
with the second protein separation apparatus to generate a second separated polypeptide 
preparation. In some embodiments, at least a portion of the second separated polypeptide 

25 preparation is spotted onto a soHd surface, thereby generating a protein microarray. In 
some preferred embodunents, the solid surface comprises a nitrocellulose slide. 

In additional embodiments, the protein microarray is exposed to sera or other 
biological fluids. In preferred embodiments the sera sample is taken from a subject 
having a diseases (e.g., to be compared to a control sample from non-diseased subject). 

30 In some preferred embodiments, the sera sample exposed to the protein microarray is 



7 



wo 2004/063719 



PCTAJS2004/000774 



analyzed to detect antibody binding (e.g., to identify antftodies that differentially bind 
from the experimental sample to a different degree than those from a control sample). 

DESCRIPTION OF THE FIGURES 

5 Figure 1 shows an example 2-D protein display using Isoelectric Focusing Non- 

Porous Reverse Phase HPLC (EF-NP RP HPLC) separation of human erythroleukemia 
cell lysate proteins in one embodiment of the present invention. 

Figure 2 shows a zoom area of a portion of the display in Figure 1 (pi = 4.2 to 7.2 
and tR = 6.0 to 9.0) (right panel showing banding patterns) and a corresponding example 
10 of linked HPLC data (left panel showing peaks). 

Figure 3 shows a quantification of rotofor fractions in one embodiment of the 
present invention. 

Figure 4 shows NP RP HPLC separation from a Rotofor fraction of HEL cell 
lysate in one embodiment of the present invention. 
15 Figure 5A and 5B show short (5A) and long (5B) NP RP HPLC separation 

gradient times for a rotofor fraction of HEL cell lysate in one embodiment of the present 
invention. 

Figure 6 shows an example of Copmassie blue stained 2-D PAGE separation of 
HEL cell lysate proteins. 
20 Figure 7 shows a direct side-by-side comparison of lEF-NP RP HPLC (four lanes 

on the left) with 1-D SDS PAGE (four lane on the right) for several Rotofor fractions in 
certain embodiments of the present invention. 

Figures 8A and 8B show MALDI-TOF MS tryptic peptide mass maps for - 
enolase isolated by lEF-NP RP HPLC (8 A) and by 2-D PAGE (8B). 
25 Figure 9 shows a 2D protein image of Isoelectric Focusing - Non-porous RP 

HPLC - ESI oa TOF/MS (lEF-NPS RP HPLC-ESI pa TOF/MS) separation of human 
erythroleukemia cell lysate proteins. 

Figure 10 shows a zoom of the 2D protein image from Figure 9 of 35 kDa to 52 
kDa mass range. 

30 Figure 11 A and 1 IB show actin multiply charged umbrella with MaxEnt 

deconvoluted molecular weight mass spectrum. The umbrella for beta and gamma actin 
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is shown in Figure 1 1 A, each foim of actin being labeled with the charge state. Figure 
1 IB shows the resulting molecular weight mass spectrum for actin where the two fomis 
of actin are separated. 

Figure 12 shows combined protein molecular weight mass spectrum jfrom a 
5 Rotofor fraction shown in traditional peak format. 

Figure 13 shows a zoom of 2D protein image from Figure 9 of 5 kDa to 40 kDa 
mass range. 

Figure 14 shows a chromatofocusing profile of MCF-lOA whole cell lysate. 
Figures 15 A, B, and C show NP-RP-HPCI^ESI-oaTOF TIC (total ion count) 
10 profile of three sample flections identified in Figure 14. 

Figure 16 shows an integrated and deconvoluted TIC profile of the three sample 
fractions from Figure 15, as generated with MaxEntl software. 

Figure 17 shows the anion exchange profile of Siberian Permafrost whole cell 
lysate of sample 23-9-25. 
15 Figures 1 8A and 1 8B show the NP-RP-HPLC-ESI-oaTOF TIC profile of two 

fractions from Figure 17. 

Figure 19 shows an overview of the automated protein analysis system utilized in 
some embodiments of the present invention. 

Figure 20 shows a flow chart of the automated protein separation and analysis 
20 methods of the present invention. 

Figure 21 shows an overview of the separation methods used in one embodiment 
of the present invention. 

Figure 22 shows the multiple ionic capillary coating procedure utilized in some 
embodiments of the present invention. 
25 Figure 23 shows a schematic description of CE-MS instrumentation used in some 

embodiments of the present invention. 

Figure 24 shows a rq)resentative chromatogram of an ipHPLC run. 
Figure 25 shows a representative 3-D profile of a CE-MS run. 
Figure 26 shows a CE-MS elution profile of a tryptic digest. 
30 Figure 27 shows a MS/MS spectrum of a tryptic digest of heat shock protein. 
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Figure 28 shows a comparison between theoretical and experimental oVs and MW 
of identified proteins. 

Figure 29 shows a comparison of coverage between different MS methods. 

Figure 30 shows a flow chart of the flow of information in some embodiments of 
5 the present invention. 

Figure 31 shows a 2-D map of the Uquid separations of the LnCAP cells using 
chromatofocusing over a pH range of 4-7 followed by separation in the second dimension 
using MPS RPHPLC. 

Figure 32 shows the MPS RP HPLC separation of CF fraction #5 m Example 11 
10 at pH 6.0-6.2. 

Figure 33 shows a concentration profile as a function as a function of pi for the 
chromatofocusing of the LnCap cell line from pH 7.0 to pH 4.0. 

Figure 34 shows the detection of humoral response in serum from normal and 
prostate cancer patients. 
15 Figure 35 shows detection of humoral response in prostate cancer agamst fraction 

B6. 

Figure 36 shows detection of humoral response in prostate cancer against Creatine 
Kinase fractionated at pH 6-6.2. 

Figure 37 shows detection of humoral response in prostate cancer against 54 kDa 
20 nuclear RNA-DNA binding protein fractionated at pH 6-6.2. 

GENERAL DESCRIPTION OF THE INVENTION 

The present invention relates to protein separation and analysis methods capable 
of resolving large numbers of cellular proteins, including methods for efficiently 
25 facilitating the transfer of protein samples between separation phases. In some 

embodiments, the present invention provides automated methods for tiie separation and 
characterization of proteins using capillary electrophoresis and tandem ion trap mass 
spectroscopy. 

The methods of the present invention further provide protein profile maps for 
30 imaging and comparing protein expression patterns. The present invention provides 

alternatives to traditional 2-D gel separation methods for ttie screening of protein profiles. 
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Many limitations of traditional 2-D PAGE arise from its use of the gel as the sq)aration 

media. The present invention provides alternative media for the separation that offer 

agnificant advantages over 2-D PAGE techniques. For example, in some embodiments, 

the present invaition provides methods that use two dimensional separations, whwe the 

5 second dimensional separation occurs in the liquid phase, rather than 2-D PAGE 

techniques where the final separation occurs in gel. 

The present invention provides systems and methods for protein separation and 

mapping that are highly efficient, amenable to automation, and provide detailed 

resolution. For example, in some methods of the present invention, proteins are separated. 

10 according to their pi, using isoelectric focusing (lEF) (e.g., in the Rotofor); according to 

their hydrophobicity using non-porous reverse phase HPLC (NFS RP HPLC); and 

according to mass using ESI oa TOF/MS or other mass spectrometry techniques. The 

present invention further provides novel techniques for eluting proteins from a separation 

apparatus (e.g., the first phase separation apparatus). For example, in one embodiment of 

1 5 the present invention, the proteins eluted from the first dimension are peeled off from 

the column according to their pH, either one pH unit or fraction thereof, at a time. In 

some embodiments, these focused Uquid firactions are then separated according to their 

hydrophobicity and size (or other desired properties) in the second dimension. Liquid 

fractions fltmi, for example. NP-RP-HPLC can be conveniently analyzed directly on-Une 

20 using mass spectrometry (eg., BSI-oaTOF) to obtain their molecular weight and relative 

abundance, which provides a tod dimension. As a result, a virtual 2-D protein image is 

created and is analogous to a 2-D gel image. 

Experiments conducted during the development of the present invention have 

demonstrated that these methods are capable of separating large numbers of proteins. 

25 The 2-D image of these proteins, analogous to that of a 2-D gel, can be generated for the 
purpose of observing distinctive patterns from a particular cell line. This protein pattern 
provides relative quantitative information, high mass resolution and high accuracy pi and 
mass values. Given that the intensity, mass and pi values are reproducible, one can study 
differential expression of proteins where the resulting 2-D images from different cells, 

30 tissues, or samples can be quantitatively compared to identify points of interest. 
Furthermore, automation and speed of analysis are peatly facilitated given that the 
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proteins remain in the liquid phase throughout the separation. The method, abbreviated 
lEF-NPS RP HPLC-ESI oa TOF/MS is shown to be a viable alternative for the separation 
of complex protein mixtures and the generation of high-resolution 2-D images of cellular 
protein expression. 

5 In some embodiments of the present invention, proteins are separated in a first 

dimension using any of a large number of protein separation techniques including, but not 
limited to, ion exclusion, ion exchange, noraial/reversed phase partition, size exclusion, 
ligand exchange, liquid/gel phase isoelectric focusing, and adsorption chromatography. 
In some preferred embodiments of the present invention, the first dimension is a liquid 

1 0 phase separation method. The sample from the first separation is passed through a 
second dimension separation. In prefeired embodiments of the present invention, the 
second dimension separation is conducted in liquid phase. The products from the second 
dimension separation are then characterized. For example, in preferred embodiments, the 
products of the second separation step are detected and displayed in a2-D fomiat based 

15 on the physical properties of the proteins that were distinguished in the first and second 
separation steps (eg., under conditions such that the first and the second physical 
properties are revealed for at least a portion of the proteins). The products may be fiirtha: 
analyzed, for example, by mass spectrometry to determine the mass and/or identity of the 
products or a subset of the products. In these embodiments, a three dimensional 

20 characterization can be appUed (z.e., based on the physical properties of the first two 
separation steps and the mass spectrometry data). It is contemplated that other protein 
processing steps can be conducted at any stage of the process. 

In certain embodiments of the present invention, the steps are combined in an 
automated system. In preferred embodiments, each of the steps is automated. For 

25 example, the present invention provides a system that includes each of the separation and 
detection elements in operable combination so that a protein sample is applied to the 
system and the user receives expression map displays or other desired data output. To 
achieve automation, in preferred embodiments, the products of each step should be 
compatible with the subsequent step or steps. 

30 In one illustrative embodiment of the present invention protems are separated 

according to their pl, using isoelectric focusing QEF) in a Rotofor and according to their 
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hydrophobicity and molecular weight using NP RP HPLC. This combined separation 
method is abbreviated EF-NP RP HPLC. When coupled with mass spectrometry (MS) 
this technique becomes three-dimensional and allows for the creation of a protein map 
that tells the pi and the molecular weight of the proteins in question. This information 

5 can be plotted in an image that also depicts protein abundance. The end result is a high- 
resolution image showing a complex pattern of proteins separated by pi and molecular 
weight and indicating relative protein abundances. This image can be used to determine 
how the proteins in a given cell line or tissue may change due to some disease state, 
pharmaceutical treatment, natural or induced differentiation, or change in environmental 

10 conditions. The image allows the observer to determine changes in pi, molecular weight, 
and abundance of any protein in the image. . When interfaced to MS the identity of any 
target protein may also be obtained via enzymatic digests and peptide mass map analyses. 
In addition, this technique has the advantage of very high loadabiUty 1 gram) such 
that the lower abxmdance proteins may be detected. 

15 In traditional 2-D PAGE separation and display techniques, the second phase 

separation is conducted in a gel (i.e., not a liquid phase) and the proteins are separated 
and detected by differences in molecular weight. In contrast, in some embodiments of 
the present invention, the second phase separation is conducted in Uquid phase. The 
products of the second phase separation techniques of the present invention are much 

20 more amenable to further characterization and to interpretation of data produced from the 
second phase. For example, in some embodiments of the present invention, the second 
phase is conducted using HPLC where the separated protein products are readily detected 
as peak fractions and interpreted and displayed in two dimensions by a computer based 
on the physical properties of the first and second separation steps. The products of HPLC 

25 separation, being in the liquid phase, are readily used in further detection steps (eg., mass 
spectrometry). The methods of the present invention, as compared to traditional 2-D 
PAGE, allow more sample to be analyzed, are more efficient, facilitate automation, and 
allow for the analysis of proteins that are not detectable with 2-D PAGE. 

For example, in one illustrative embodiment of the present invention, the protein 

30 profile of human erythroleukemia (EEL) cells has been analyzed using the methods of 
the present invention as well as traditional gel based methods for comparison purposes. 
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Two-dimensional images were generated representing each of the separation methods 
used. Proteins were separated and then collected using both the EEF-NP RP HPLC of the 
present invention and 2-D PAGE methods. These proteins were then enzymatically 
digested and the peptide mass maps were determined by MALDI-TOF MS (if a protein 

5 cannot be unambiguously identified by this method, further analysis is made by any 
number of techniques including, but not limited to, LC/MS-MS, PSD-MALDI, NMR, 
Western blotting, Edman sequence analysis and mass spectrometry can help with further 
analysis of proteins [See e.g., Yates, J. Mass Spec, 33:1 (1998); Chen et iz/.. Rap. Comm. 
Mass Spec, 13:1907 (1999); Neubauer and Mann, Anal. Chem. 71:235 (1999); Zugaro et 

10 al. Electrophoresis 19:867 (1998); hnmler et al. Electrophoresis 19:1015 (1998); Reid et 
ah, Electrophoresis 19:946 (1998); Rosenfeld, et al. Anal. Biochem., 203:173 (1992); 
Matsui et al. Electrophoresis 18:409 (1997); Patterson and Aebersold, Electrophoresis 
16:1791 (1995)]). 

In some embodiments, the proteins were tentatively identified using MS-Fit to 
1 5 search the peptide mass maps against the Swiss and NCBIiu- protein databases. This 
work demonstrated that a large number of proteins, with a useful mass range, were 
separated using the methods of the present invention and that a 2-D image of these 
proteins was reproducibly generated for the purpose of observing distinctive patterns that 
are associated with a particular cell line. The methods of the present invention allowed 
20 for the detection of proteins not observed with the 2-D PAGE technique. Automation 
and speed of analysis are also greatly facilitated given that the proteins remain in the 
liquid phase throughout the separation. 

In some embodiments, the present invention provides an automated protein 
• separation and characterization system. The system is fully integrated and transfers and 
25 coordinates multi-phase, orthogonal separation methods. In some embodiments, the 
infonnation is transferred by the automated system to software for the generation of 
multi-dimensional protein maps. Automation provides increased speed, efficiency, and 
sample recovery while eliminating potential sources of contamination and sample loss. 
Thus, the methods of the present invention are shown to be an advantageous 
30 technique for the generation of images of protein expression profiles as well as for the 
collection ofindividual proteins for further analyses. These capabilities allow one to 
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momtor changes in protein expression that are linked to differentiation pathways as well 
as particular conditions such as cancer {See e.g., Hanash, Advances in Electrophoresis; 
Chrambach, A., Editor, pp 1-44 [1998]), cell aging {See e.g., Steller, Science 267:1445 
[1995]), the response of cells to environmental insuU {See e.g., Welsh et al, Biol. 
5 Reprod, 55:141 [1996]), or the response of cells to some pharmaceutical agent. Having 
identified significant changes in protein expression, one can then further analyze proteins 
of interest to determine their identity and whether they have been altered from their 
expected structure by sequence changes or post-translational modifications. 

10 Definitions 

To facilitate an understanding of the present invention, a number of terms and 
phrases are defined below: 

As used herein, the term "separated protein fraction" refers to one fraction of a 
sample separated in one or more dimensions (e.g., a fraction from a chromatography 
15 separation). 

As used herein, the term "microarray" refers to a soUd surface comprising a 
plurality of addressed biological macromolecules {e.g., proteins, polypeptides, nucleic 
acids, antibodies, etc.). In some preferred embodiments, the location of each of the 
macromolecules in the microarray is known, so as to allow for identification of the 
20 samples following analysis. 

As used herein, the term "array of target molecules" or "array of target molecule 
binding targets" refers to a microarray of target molecules that are known to, or are 
suspected of, associating with a "cellular product." 

As used herein, the terms "solid support" or "support" refer to any material that 
25 provides a solid or semi-solid structure with which another material can be attached. 
Such materials include smooth supports (e.g., metal, glass, plastic, silicon, and ceramic 
surfaces) as well as textured and porous materials. Such materials also include, but are 
not limited to, gels, rubbers, polymers, and other non-rigid materials. Solid supports need 
not be flat. Supports include any type of shape including spherical shapes (e.g., beads). 
30 Materials attached to solid support may be attached to any portion of the solid support 
(e.g., may be attached to an interior portion of a porous sohd support material). Preferred 

15 
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embodiments of the present invention have biological molecules such as proteins 
attached to solid supports. A biological material is "attached" to a solid support when it 
is associated with the solid support through a non-random chemical or physical 
interaction. In some preferred embodiments, the attachment is through a covalent bond. 

5 However, attaclunents need not be covalent or permanent. In some embodiments, 

materials are attached to a solid support through a "spacer molecule" or "Unking group." 
Such spacer molecules are molecules that have a first portion that attaches to the 
biological material and a second portion that attaches to the solid support. Thus, when 
attached to the solid support, the spacer molecule separates the solid support and the 

10 biological materials, butis attached to both. 

As used herein, the term "an apparatus configured for automated sequential 
capillary electrophoresis - mass spectroscopy - mass spectroscopy of said at least one 
sample" refers to an apparatus configured for automated sequential analysis by capillary 
electrophoresis, mass spectroscopy, and a second mass spectroscopy step without user 

15 intervention. In some preferred embodiments, the apparatus performs automated sample 
preparation, injection, and analysis. In some embodiments, the apparatus includes 
robotics for automated sample handling. In some embodiments, the apparatus includes 
software and a computer system for directing the operation of the apparatus , "a 
software program configured for performing said automated sequential capillary 

20 electrophoresis - mass spectroscopy - mass spectroscopy of said at least one sample"). 

As used herein, the term "automated sample preparation" refers to the preparation 
of a sample for analysis using the apparatus of the present invention in the absence of 
operator intervention. In some embodiments, automated sample preparation is performed 
by robotics. In some embodiments, software and a computer system control the 

25 automated sample preparation. Automated sample preparation includes all steps 

necessary to prepare a sample for analysis including, but not limited to, partial dry down 
and enzymatic digestion. 

* As used herein, the term "automatic fi-action injector configured for the injection 
of said at least one sample into said apparatus" refers to an apparatus configured for the 

30 injection of sample into a capillary electrophoresis - mass spectroscopy - mass 

spectroscopy apparatus in the absence of operator intervention. In some embodiments, 
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the automated fraction injector utilizes robotics controlled by software and a computer 
syst^. 

As used herein, the term "CE-MS/MS analyzed sample" refers to a sample that 
has been treated by the capillary electrophoresis - mass spectroscopy - mass spectroscopy 

5 apparatus of the present invention. In preferred embodiments, the analyzed sample 

comprises information relating to a polypeptides contained in the sample (e.g., including, 
but not limdted to, protem MW, identity, and structure). 

As used herein, the term "multiphase protein separation" refers to protein 
separation comprising at least two separation steps. In some embodiments, multiphase 

10 protein separation refers to two or more separation steps that separate proteins based on 
different physical properties of the protein (e,g,, a jSrst step that separates based on 
protein charge and a second step that separates based on protein hydrophobicity). 

As used herein, the term "protein profile m^s" refers to representations of the 
protein content of a sample. For example, "protein profile map" includes 2-dimensional 

1 5 displays of total protein expressed in a given cell. In some embodiments, protein profile 
maps may also display subsets of total protein in a cell. Protein profile maps may be used 
for comparing "protein expression patterns" (e.g., the amount and identity of proteins 
expressed in a sample) between two or more samples. Such comparing find use, for 
example, in identifying proteins that are present in one sample (e.g., a cancer cell) and 

20 not in another (e.g., noimal tissue), or are over- or under-expressed in one sample 
compared to the other. 

As used herein, the term "separating apparatus capable of separating proteins 
based on a physical property" refers to compositions or systems capable of separating 
proteins (e.g., at least one protem) fi-om one another based on differences in a physical 

25 propCTty between proteins present in a sample containing two or more protein species. 
For example, a variety of protein separation columns and composition are contemplated 
including, but not limited to ion exclusion, ion exchange, normal/reversed phase 
partition, size exclusion, ligand exchange, liquid/gel phase isoelectric focusing, and 
adsorption chromatography. These and other apparatuses are capable of separating 

30 proteins fi-om one another based on their size, charge, hydrophobicity, and ligand binding 
affinity, among other properties. A "Uquid phase" separating apparatus is a separating 
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apparatus that utilizes protein samples contained in liquid solution, wherein proteins 
remain solubilized in liquid phase during separation and wherein the product (e.g., 
fractions) collected from the apparatus are in the liquid phase. This is in contrast to gel 
electrophoresis apparatuses, wherein the proteins enter into a gel phase during separation. 
5 Liquid phase proteins are much more amenable to recovery/extraction of proteins as 
compared to gel phase. In some embodiments, liquid phase proteins samples may be 
used in multi-step (e.g,, multiple separation and characterization steps) processes without 
the need to alter the sample prior to treatment in each subsequent step (e.g., without the 
need for recovery/extraction and resolubilization of proteins). 

10 As used herein, the term "displaying proteins" refers to a variety of techniques 

used to interpret the presence of proteins within a protein sample. Displaying includes, 
but is not limited to, visualizing proteins on a computer display representation, diagram, 
autoradiographic film, list, table, chart, etc. "Displaying proteins under conditions that 
first and second physical properties are revealed" refers to displaying proteins (e.g., 

15 proteins, or a subset of proteins obtained from a separating apparatus) such that at least 
two different physical properties of each displayed protein are revealed or detectable. For 
example, such displays include, but are not limited to, tables including columns 
describing (e.g., quantitating) the first and second physical property of each protein and 
two-dimensional displays where each protein is represented by an X,Y locations where 

20 the X and Y coordinates are defined by the first and second physical properties, 

respectively, or vice versa. Such displays also include multi-dimensional displays (^.g., 
three dimensional displays) that include additional physical properties. 

As used herein, "characterizing protein samples under conditions such that first 
and second physical properties are analyzed" refers to the characterization of two or more 

25 proteins, wherein two different physical properties are assigned to each analyzed (e.g,, 
displayed, computed, etc.) protein and wherein a result of the characterization is the 
categorization (i.e., grouping and/or distinguishing) of the proteins based on these two 
different physical properties. For example, in some embodiments, two proteins are 
separated based on isoelectric point and hydrophobicity. 

30 As used herein, the teraa "comparing first and second physical properties of 

separated protein samples" refers to the comparison of two or more protein samples (or 
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individual proteins) based on two different physical properties of the proteins within each 
protein sample. Such comparing includes grouping of proteins in the samples based on 
the two physical properties and comparing certain groups based on just one of the two 
physical properties (i.e., the grouping incorporates a comparison of the other physical 
5 property). 

As used herein, the term "delivery apparatus capable of receiving a separated 
protein from a separating apparatus"' refers to any apparatus {e.g., microtube, trough, 
chamber, etc.) that receives one or more fractions or protein samples from a protein 
separating apparatus and delivers them to another apparatus {e,g., another protein 

1 0 separation apparatus, a reaction chamber, a mass spectrometry apparatus, etc.). 

As used herein, the term "detection system capable of detecting proteins" refers to 
any detection apparatus, assay, or system that detects proteins derived from a protein 
separating apparatus (e.^., proteins in one or fractions collected from a separating 
apparatus). Such detection systems may detect properties of the protein itself (e.g. , UV 

15 spectroscopy) or may detect labels (e.g., fluorescent labels) or other detectable signals 
associated with the protein. The detection system converts the detected criteria (e.g., 
absorbance, fluorescence, luminescence etc.) of the protein into a signal that can be 
processed or stored electronically or through similar means (e.g., detected through the use 
of a photomultipUer tube or similar system). 

20 As used herein, the term "buffer compatible with an apparatus" and "buffer 

compatible with mass spectrometry" refer to buffers that are suitable for use in such 
apparatuses (e.g., protein separation apparatuses) and techniques. A buffer is suitable 
where the reaction that occurs in the presence of the buffer produces a result consistent 
with the intended purpose of the apparatus or method. For example, a buffer compatible 

25 with a protein separation apparatus solubilizes the protein and allows proteins to be 

separated and collected from the apparatus. A buffer compatible with mass spectrometry 
is a buffer that solubiUzes the protein or protein fragment and allows for the detection of 
ions following mass spectrometry. A suitable buffer does not substantially interfere with 
the apparatus or method so as to prevent its intended purpose and result (i.e., some 

30 interference may be allowed). 
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As used herein, the term "automated sample handling device" refers to any device 
capable of transporting a sample (e.g., a separated or un-separated protein sample) 
between components (e.g. , separating ^paratus) of an automated method or system (e.g., 
an automated protein characterization system). An automated sample handling device 

5 may comprise physical means for transportmg sample (e.g., multiple lines of tubing 
connected to a multi-channel valve), hi some einbodiments, an automated sample 
handling device is connected to a centraUzed control network. 

As used herein, the term "switchable multi channel valve" refers to a valve that 
directs the flow of hquid through an automated sample handling device. The valve 

10 preferably has a plurality of channels (e.g., 2 or more, and preferably 4 or more, and more 
preferably, 6 or more). In addition, in some embodiments, flow to individual channels is 
"switched" on an off. In some embodiments, valve switching is controlled by a 
centralized control system. A switchable multi-channel valve allows multiple apparatus 
to be connected to one automated sample handler. For example, sample can first be 

15 directed through one apparatus of a system (e.g., a first chromatography apparatus). The 
sample can then be dhrected through a different channel of the valve to a second 
apparatus (e.g., a second chromatography apparatus). 

As used herein, the terms "centralized control system" or "centralized control 
network" refer to information and equipment management systems (e.g., a computer 

20 processor and computer memory) operably linked to multiple devices or apparatus (e.g., 
automated sample handling devices and separating apparatus). In preferred 
embodiments, the centralized control network is configured to control the operations of 
flie apparatus and/or device linked to the network. For example, in some embodiments, 
the centralized control network controls the operation of multiple chromatography 

25 apparatuses, the transfer of sample between the apparatuses, and the analysis and 
presentation of data. 

As used herem, the terms "computer memory" and "computer memory device" 
refer to any storage media readable by a computer processor. Examples of computer 
• memory include, but are not limited to, RAM, ROM, computer chips, digital video disc 

30 (DVDs), compact discs (CDs), hard disk drives (HDD), and magnetic t^e. 
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As used herein, the term "computer readable medium" refers to any device or 
system for storing and providing information data and instructions) to a computer 
processor. Examples of computer readable media include, but are not limited to, DVDs, 
CDs, hard disk drives, magnetic tape and servers for streaming media over networks. 

As used herein, the terms "processor" and "central processing unit" or "CPU" are 
used interchangeably and refers to a device that is able to read a program from a 
computer memory (e.g., ROM or other computer memory) and perform a set of steps 
according to the program. 

As used herein, the term "directly feeding" a protein sample from one apparatus to 
another apparatus refers to the passage of proteins from the first apparatus to the second 
apparatus without any intervening processing steps. For example, a protein that is 
directly fed from a protein separating apparatus to a mass spectrometry apparatus does 
not undergo any intervening digestion steps (z.e., the protein received by the mass 
spectrometry apparatus is undigested protein). 

As used herein, the term "sample" is used in its broadest sense. Li one sense it 
can refer to a cell lysate. In another sense, it is meant to include a specimen or culture 
obtained from any source, including biological and environmental samples. Biological 
samples maybe obtained from animals (including humans) and encompass fluids, solids, 
tissues, and gases. Biological samples include blood products (e.g., plasma and serum), 
saliva, urine, and the like and includes substances from plants and microorganisms. 
Environmental samples include environmental material such as surface matter, soil, 
water, and industrial samples. These examples are not to be construed as limiting the 
sample types applicable to the present invention. 

DETAILED DESCMPTION OF im INVENTION 

The present invention relates to automated methods, systems, and apparatuses for 
protein separation and analysis, hi particular, the present invention provides an 
automated system for the separation, identification, and characterization of protein 
samples. 

hi some embodiments, the present invention provides systems and apparatuses for 
liquid separation of the protein content of a cellular sample with subsequent structural 
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analysis using capillary electrophoresis tandem time-of-flight mass spectrometry. In 
some embodiments, the liquid separation uses a 2-D dimensional liquid separation of the 
protein content of the cell using chromatofocusing to generate a pi based separation 
followed by nonporous RP HPLC separation to generate an image of the protein content 

5 of the saniple. In some embodiments, the liquid protein are collected in a fraction 
collector and then each sample is digested and analyzed by capillary electrophoresis 
separation with on-line mass spectrometric analysis using a quadrupole ion 
trap/time-of-flight tandem mass spectrometer for identification and sequence analysis. 
The systems of the present invention are easily automated to identify and perform 

10 structural analysis on hundreds of proteins collected in the liquid phase per day. The 
below description provides a non-limiting description of exemplary embodiments of the 
present invention. One skilled in the relevant art recognizes that alternative embodinaents 
fall within the scope of the present invention. 

15 L Automated Protein Analysis System 

In some embodiments, the present invention provides an automated protein 
analysis system. In some embodiments, proteins are first separated using any suitable 
separation method (e.g., including, but not limited to, those disclosed herein). In some 
preferred embodiments, proteins are separated usmg the 2-D liquid separation methods 

20 described herein. For example, in some embodiments, the method to be used in the first 
dimension involves a liquid ph based pi separation (e.g., including, but not limited to, 
chromatofocusing, isoelectric focusing in a Rotofor device or Isoprime-type device, or 
free flow electrophoresis in the liquid phase; See e.g., below description of separating 
methods). In some embodiments, the pi fractions from the first separation are 

25 automatically collected and injected into the second phase for analysis. In some 
embodiments, the second phase uses nonporous reversed phase chromatography to 
separate the proteins from each pi fraction in the second dimension. 

The present invention is not limited to liquid separation. Any separation method 
may be utilized (eg., gel electrophoresis). The present invention is also not limited to 2- 

30 dimension separation. Any separation method that separates samples in one or more 
dimensions may be utilized. 
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In some embodiments, the present invention fiffther provides software to generate 
a 2-D image of the protein content of the ceU. Following separation, samples are 
analyzed using the CE-MS/MS methods described below. 

5 A. CE-MS/MS 

In some preferred embodiments, the present invention provides a capillary 
electrophoresis - Mass Spectroscopy - Mass spectroscopy (CE-MS/MS) apparatus to 
identify and characterize separated proteins in the samples. In some preferred 
embodiments, isolated proteins are collected in the liquid phase directly from the liqiud 

10 eluent of the HPLC or other protein separation apparatus {See e.g. , above description of 
sq)aration methods). For example, in some embodiments, each protein fraction is 
collected by an automatic fraction coUector, partiaDy dried down and enzymatically 
digested. 

The protein digests are then sequentially analyzed by c^Ulary electrophoresis 
15 (CE)-mass spectrometry (e.g.,TOF mass spectroscopy), hi some embodiments, an 
automatic fraction injector is utilized to inject each sample. For example, in some 
embodiments, the capillary is dipped into an enzyme digest (e.g. , by a computer 
controlled robotic arm). The enzyme digest is injected into the capillary by applying a 
loading voltage (e.g., using a computer controlled robotic aim). The capillary is then 
20 removed from the digest vial (e.g., using the robotic aim) and lowered into a buffer vial. 
The CE running voltage is then turned on and separation of ttie digest occurs in 
approximately ten minutes. The peptides are then detected on-line using mass 
spectrometry (e.g.,TOF mass spectroscopy). In some preferred embodiments, an 
ion-trap/TOF MS is used to detect the peptide masses. One advantage of this method is 
25 ftat the intact ions are detected and the MW of the peptide detramined. 

hi some embodiments, as the eluting peak is being detected, the MW is identified 
(feg., by an automated computer system), and the parent ion and fragment ion are isolated 
to perform tandem MS (MS/MS) to obtam structural analysis of the peptide, hi other 
embodiments, manual override during the single injection is used to force fragmentation 
30 at a specific MW when desired. In other embodiments, a second injection with manuaUy 
loaded parameters is utilized. In some embodiments, the automated selection method is 
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applied to the entire peptide digest as it elutes from the CE and a peptide map with 
MS/MS obtained for the protein. The MW, protein digest, and additional data (e.g., pi) is 
then be used to identify the protein against a database with high certainty {See e.g., below 
description of database searching), hi addition, in some embodiments, the MS/MS data is 
5 used to confirm identification and also for detailed structural analysis or to identify the 
presence and position of posttranslational modifications. 

For example, in some embodiments, for the real time MW identification and 
subsequent structural analysis of eluting peptides, the MS is initially configured to collect 
intact ions without fragmentation. Each mass specti^ is analyzed over the entire mass 
10 range for the presence ofions exceeding a threshold. In some embodiments, when a 
computer algorithm determines that a peptide has begun eluting, its MW is determined. 
Fragmentation for structural identification is then performed for the remainmg duration 
of ihe peak's elution. The ions in the eluting peak are confined to an ion trap and a 
waveform consisting of aU frequencies except for the frequency resonance with tiie 
15 desired ion is applied to the cell. This notched waveform excludes all interfereaices from 
the ion ti-ap. Then a low amplitiide frequaicy that is resonant witii the desired ion is 
applied to partially fragment it. When subsequentiy ejected from the ion trap, both parent 
and fragment ion information is obtained free of background. In order to know the 
appropriate frequency to apply to the ion tr^, the mass-to-charge ratio of the desired ion 
20 must be known. In some embodiments, tiie software of tiie present invention is used to 
detennine this in real time as a peak begins to elute. The notched waveform is 
immediately calculated and applied to the ion ti^p so that background-free spectra can be 
acquired over as much of the peak as possible. This capabifity avoids tiie necessity of 
performing two injections with the first done to obtain the desired ion-of-interest 
25 information. This more than doubles the throughput. The amplitude of tiie frequency 
resonant witii tiie desired ion-of-interest is feedback adjusted in real time from spectra to 
spectra to optimize tiie parent to fragment ion ratio. 

The CE-MS/MS apparatus of tiie present invention provides several advantages 
over cuirentiy available systems. For example, ttie ion trap MS/MS system of tiie present 
30 invention is able to rapidly process samples separated by CE. In some prefened 
embodiments, tiie ion tr^ MS systems of die present invention are able to process 
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spectrum at a rate of 2-10 spectra/sec. This is in contrast to currently available scanning 
MS systems (e g., those available from Micromass, and Applied Biosystems, Foster City, 
CA), which are not able to process spectrum fast enough to keep up with CE separations. 
In addition, in some embodiments, the present invention provides an all Uquid, automated 
5 separation and analysis system. This system (See below description of automation and 
software) provides the added benefit of sequential purification and analysis without 
additional user intervention. 

B. . Software 

10 In some embodiments, the present invention further provides software for the 

operation of the automated apparatus of the present invention and the analysis of data. 
Figure 30 shows a flow chart describing the flow of information in one illustrative 
embodiment In some preferred embodiments, the software is in communication with a 
computer processor and the CE-MS/MS apparatus of the present invention. In some 

1 5 preferred embodiments, the software controls all aspects of the analysis and 

characterization. For example, in some embodiments, the software controls robotic arms 
and auto injectors used to prepare samples for CE-MS/MS and inject samples into the 
CE. The software ftirfher controls a high voltage power supply for CE sample loading 
and running, and CE column/electrode mechanical positioning for loading and running 

20 and overall system timing. 

In some embodiments, the software analyzes the parent ion peak and determines 
whether or not to Segment the remaining peak for a second round of MS. The notched 
waveform is immediately calculated and applied to the ion trap so that background-free 
spectra can be acquired over as much of the peak as possible. This capability avoids the 

25 necessity of performing two injections with the first done to obtain the desired ion-of- 
interest information. This more than doubles the throughput. The amplitude of the 
frequency resonant with the desired ion-of-interest is feedback adjusted in real time from 
spectra to spectra to optimize the parent to fragment ion ratio. 

In some embodiments, data from the time of flight mass spectrometer is digitized 

30 and recorded by a computer using the software of the present invention. In some 
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embodiments, the software further provides representation of tbe data as a two 
dimensional image for visual identification of important mass spectral peak locations. 

In still further embodiments, software analyses multi-dimension protein maps 
generated from separation steps and determines which fractions to select for further 

5 embodiments. In yet other embodiments, the software analyzes the CE-MS/MS data, in 
combination with data obtained in separation steps, to determine protein identity (e.g., by 
searching peptide databases) and to display results. In some embodiments, results are 
displayed as a multidimensional protein map with information corresponding to protein 
MW, identity, and other physical properties. In some embodiments, the results are 

10 displayed on a computer display screen. 

C. Automation 

In some embodiments, the present invention provides fvdly automated protein 
separation and characterization. Previously, two mass spectroscopy injections were 

15 required to accomplish stmctural determination. The first was obtained without 

fragmentation in order to acquire the protein MW information. Manual analysis was 
performed to construct a table of expected protein elution times, MW and corresponding 
waveforms. The second injection used that information to decide when to apply the 
specified pre-determined information for fragmentation. The real time on-the-fly method 

20 of the present invention results in time savings of more than a factor of two (only one vs. 
two injections required plus manual analysis time not required) and uses one half the 
sample amount. Furthermore, multiple samples can be sequentially analyzed without 
having to wait for manual analysis of the first injection for each. 

M some embodiments, once a protein digest is analyzed by CE-TOF MS/MS, the 

25 computer places the data in a format that it is imported directly into a database for 

identification and analysis. The computer then instructs the automatic flection injector to 
inject the next sample. The next CE can then proceed while data analysis of the first 
digest is performed by the computer. The 50-100 samples typically isolated from each pi 
fraction in flie liquid phase can be easily processed using the automated methods of the 

30 present invention. For example, at approximately 10 minutes per sample, it takes 
approximately 8-10 hours to analyze the proteins from each pi m detail. In some 
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embodiments, multiple CE-MS/MS systems are linked in tandem to one purification 
system, allowing for the simultaneous analysis of multiple separated firactions. In 
addition, in some embodiments, the computer is programmed to automatically select 
proteins firom the 2-D liquid map for analysis. 

5 

D. The System in Operation 

Figure 19 shows an overview of one illustrative embodiment of the present 
invention. Figure 20 shows a flow chart of the separation and analysis methods of the 
present invention. Example 9 provides an exemplary method used in some embodiments 

10 of die presoit invention. 

In some embodiments, the method of the present invention provides an all liquid 
method for separation, identification and detailed analysis of proteins firom cells. In some 
embodiments, the method utilizes separation of proteins using a 2-D hquid separation 
with subsequent on-line detection using electrospray (ESI)-TOF mass spectrometry. This 

15 method provides a 2-D image of the protein content of the cell according to pi versus 
MW of the intact protein. The map is used as the standard to compare different cell Unes 
for changes in protein expression or structure. The mass map can be used to identify 
proteins that change in MW or quantitative expression for fiirther analysis. 

In some embodhnents, differential display maps are used to compare different cell 

20 lines. The software of the present mvention is used to identify proteins that change using 
the differential display mode, hi some embodiments, the computer then searches for 
these changes and instructs the automatic firaction injector to analyze only these samples 
in detail. 

In other embodiments, the methods of the present invention are utilized for high- 
25 throughput proteome analysis. For example, in some embodiments, an analysis system 
that comprises a plurality of CE-MS/MS apparatuses is utilized for the simultaneous 
analysis of MW, structural characteristics {e.g., post translational modifications), physical 
properties, and identity of proteins in a plurality of separated fractions. Using such 
methods, an entire proteome can be analyzed in the course of a day, with small sample 
30 requirements and litUe or no operator intervention required. The rapid methods of the 
present invention allow for the analysis of an organism or cell's proteome on a recurring 
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basis daily in response to an external stimulus). Such methods are useful, e.g., in 
drug screening applications, to characterize development, and to monitor malignancies. 

11. Two-Phase Separation Techniques 
5 The first dimension separates proteins based on a first physical property. For 

example, in some embodiments of the present invention proteins are separated by pi 
using isoelectric focusing in the first dimension {See Righetti, Laboratory 
Techniques in Biochemistry and Molecular Biology; Work, T. S.; Burdon, R. H., 
Elsevier: Amsterdam, p 10 [1983]). However, the first dimension may employ any 
10 number of separation techniques including, but not limited to, ion exclusion, ion 

exchange, normal/reversed phase partition, size exclusion, ligand exchange, liquid/gel 
phase isoelectric focusing, and adsorption chromatography. In some embodiments {e.g., 
some automated embodiments), it is prefeired that the first dimension be conducted in the 
liquid phase to enable products of the sqparation step to be fed directly mto a second 
1 5 liquid phase separation step. 

The second dimension separates proteins based on a second physical property 
(i.e., a different property than the first physical property) and is preferably conducted in 
the liquid phase (e.g., liquid-phase size exclusion). For example, in some embodiments 
of flie present invention proteins are separated by hydrophobicity using non-porous 
20 reversed phase HPLC in the second dimension {See e.g., Liang et al. Rap. Comm. Mass 
• Spec., 10:1219 [1996]; Griffin et al, Rq). Comm. Mass Spec, 9:1546 [1995]; Opiteck et 
al. Anal. Biochem. 258:344 [1998]; Nilsson a/.. Rap. Comm. Mass Spec, 11:610 
[1997]; Chra et al, R^. Comm. Mass Spec, 12:1994 [1998]; Wall et al. Anal. Chem., 
71:3894 [1999]; Chong et al. Rap, Comm. Mass Spec, 13:1808 [1999]). This method 
25 provides for exceptionally fast and reproducible high-resolution separations of proteins 
according to their hydrophobicity and molecular weight. The non-porous (NP) silica 
packing material used in these reverse phase (RP) separations eliminates problems 
associated witti porosity and low recovery of larger proteins, as well as reducing analysis 
times by as much as one third. Separation efficiency remains high due to the small 
30 diameter of the spherical particles, as does the loadabiUty of the NP RP HPLC columns. 
However, the second dimension may employ any numbfcr of separation techniques. For 
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example, in one embodiment, 1-D SDS PAGE lane gel is used. Having the second 
dimension conducted in the liquid phase facilitates efficient analysis of the separated 
proteins and enables products to be fed directly into additional analysis steps 
directly into mass spectrometry analysis). 
5 In certain embodiments of the present invention, proteins obtained from the 

second separation step are mapped using software (available from Dr. Stephen J. Parus, 
University of Michigan, Department of Chemistry, 930 N, University Ave., Ann Arbor, 
MI 48109-1055) in order to create a protein pattern analogous to that of the 2-D PAGE 
image-although based on the two physical properties used in the two separation steps 
10 rather than by a second gel-based size separation technique. In some embodiments, RP 
HPLC peaks are represented by bands of different intensity in the 2-D image, according 
to the intensity of the peaks eluting from the HPLC. In some embodiments, peaks are 
collected as the eluent of the HPLC separation in the liquid phase. 

In some embodiments, the proteins collected from the second dimension were 
15 identified using proteolytic enzymes, MALDI-TOF MS and MSFit database searching. 
In an example using human erythroleukemia cell lysate, using lEF-NP RP HPLC, 
^proximately 700 bands were resolved in a pi range from 3.2 to 9.5 and 38 different 
proteins with molecular weights ranging from 12 kDa to 75 kDa were identified. In 
comparison to a 2-D gel separation of the same human erythroleukemia (HEL) cell Une 
20 lysate, the lEF-NP RP HPLC produced improved resolution of low mass and basic 

proteins. In addition, the proteins remained in the liquid phase throughout the separation, 
thus making the entire procedure highly amenable to automation and high throughput. 

Certain preferred embodiments are described in detail below. These illustrative 
examples are not intended to limit the scope of the mvention. For example, although the 
25 examples are described using human tissues and samples, ttie methods and apparatuses of 
the present invention can be used with any desired protein samples including samples 
fix)m plants and microorganisms. 

A. lEF-NFRP HPLC Method 
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The following description provides certain preferred embodiments for conducting 
isoelectric separation (first dimension) and NP RP HPLC separation (second dimension) 
according to the methods of the present invention. 



1, ffiF Separation 

Proteins are extracted from cells using a lysis buffer. To facilitate an efficient 



with the downstream separation and 
-TOF-MS) to allow direct use of the 
Such a buffer is an important aspect of 



. process, this lysis buffer should be compatible ' 
analysis steps (e.g., NP RP HPLC and MALDl 
products from each step into subsequent steps. 

10 automating the process. Thus, the preferred bi ffer should meet two criteria: 1) it 
solubilizes proteins and 2) it is compatible wit i each of the steps in the 
separation/analysis methods. Although the present invention provides suitable buffers for 
use in the particular method configurations described below, one skilled in the art can 
determine the suitabiUty of a buffer for any particular configuration by solubilizing 

15 protein sample in the buffer. If the buffer solu jilizes the protein, the sample is run 
through the particular configuration of separation and detection methods desired. A 
positive result is achieved if the final step of tt e desired configuration produces 
detectable information {e.g., ions are detected in a mass spectrometry analysis). 
Alternately, the product of each step in the method can be analyzed to detemiine the 

20 presence of the desired product (e.g. , determining whether protein elutes from the 
separation steps). 

After extraction in the lysis buffer, proteins are initially separated in a first 
dimension. The goal in this step is that the proteins are isolated in a liquid fraction that is 
compatible with subsequent MP RP HPLC and mass spectrometry steps, hi these 

25 embodiments, n-octyl B-D-glucopyranoside (OGl, from Sigma) is used in the buffer, n- 
octyl B-D-glucopyranoside is one of the few detergents that is compatible with both NP 
RP HPLC and subsequent mass spectrometry analyses. It is contemplated that detergents 
of the formula n-octyl SUGARpyranoside find use in these embodiments. The lysis 
buffer utilized was 6M urea, 2M thiourea, 1 .0 % n-octyl B-D-glucopyranoside, 10 mM 

30 dithioeiythritol and 2.5 % (w/v) carrier ampholytes (3.5 to 10 pi)). After extraction, the 
supernatant protein solution is loaded to a device that can separate the proteins according 
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to their pi by isoelectric focusing (BBF). Here the proteins are solubilized in a running 
buffer that again should be compatible with NP RP HPLC. A suitable running buffer is 
6M urea, 2M thiourea, 0.5 % n-octyl fi-D-glucopyranoside, 10 mM dithioerythritol and 
2.5 % (w/v) carrier ampholytes (3.5 to 10 pi). 
5 Three exraiplary devices that may be used for this step are: 

a) Rotofor 

This device (Biorad) separates proteins in the liquid phase according to their pi 
{See e,g.y Ayala et aU Appl. Biochem. Biotech. 69:1 1 [1998]). This device allows for 
10 high protein loading and rapid separations that require only four to six hours to perform. 
Proteins are harvested into liquid fractions after a 5-hour lEF separation. These liquid 
fractions are ready for analysis by NP RP HPLC. This device can be loaded with up to 1 
g of protein. 

15 b) Carrier Ampholyte based slab gel lEF separation with a whole 

geleluter 

In this case the protein solution is loaded onto a slab gel and the proteins separate 
in to a series of gel-wide bands containing proteins of the same pi. These proteins are 
then harvested using a whole gel eluter (WGE, from Biorad). Proteins are then isolated 
20 in liquid fractions that are ready for analysis by NPRP HPLC. This type of gel can be 
loaded with up to 20 mg of protein. 

c) IPG slab gel DEF separation with a whole gel eluter 
Here the proteins are loaded onto a immobiline pi gradient slab gel and separated 
25 into a series of gel-wide bands containing proteins of the same pi. These proteins are 
electro-eluted using the WGE into liquid fractions that are ready for analysis by NP RP 
HPLC. The IPG gel can be loaded with at least 60 mg of protein. 

30 

2. Protein Separation by NPRP HPLC 
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Having obtained liquid fractions containing large amounts of pl-focused proteins, 
the second dimension separation is non-porous RP HPLC. The present invention 
provides the novel combination of employing non-porous RP packing materials 
(Eichrom) with another RP HPLC compatible detergent (e.g., n-octyl B-D- 
5 galactopyranoside) to facilitate the multi-phase separation of the present invention. This 
detergent is also compatible with mass spectrometry due to its low molecular weight. 
The use of these types of RP HPLC columns for protein separations as a second 
dimension separation after lEF in order to obtain a 2-D protein separation is a novel 
feature of the present invention. These columns are well suited to this task as the non- 

1 0 porous packing they contain provides optimal protein recovery and rapid efficient 

separations. It should be noted that though several detergents have been mentioned thus 
. far for increasing protein solubility while being compatible with RP HPLC there are 
many other different low molecular weight non-ionic detergents that could be used for 
this purpose. Several important features that allow the RP HPLC to work as a second 

15 dimension are as follows: The mobile phase should contain a low level of a non-ionic 
low molecular weight detergent such as n-octyl B-D-glucopyranoside or n-octyl 6-D- 
galactopyranoside as these detergents are compatible with RP HPLC and also with later 
mass spectrometry analyses (unlike many other detergents); the colximn should be held at 
a high temperature (around 60®C); and the colimm should be packed with non-porous 

20 silica beads to eliminate problems of protein recovery associated with porous piackings. 

3. Protein Detection and Identification via Mass Spectrometry 
In some embodiments of the present invention, the products of the second 
separation step are further characterized using mass spectrometry. For example, the 

25 proteins that elute from the NP RP HPLC separation are analyzed by mass spectrometry 
to determine their molecular weight and identity. For this purpose the proteins eluting 
from the separation can be analyzed simultaneously to determine molecular weight and 
identity. A fraction of the effluent is used to determine molecular weight by either 
MALDI-TOF-MS or ESI oa TOF (LCT, Micromass) {See e.g., U.S. Pat. No. 6,002,127). 

30 The remainder of the eluent is used to determine the identity of the proteins via digestion 
of the proteins and analysis of the peptide mass map fingerprints by either MALDtTOF- 
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MS or ESI oa TOF. The molecular weight 2-D protein map is matched to the appropriate 
digest fingeiprint by correlating the molecular weight total ion chromatograms (TIC's) 
with the UV-chromatograms and by calculation of the various delay times involved The 
UV-chromatograms are automatically labeled with the digest fingerprint jfraction number. 
5 The resulting molecular weight and digest mass fingerprint data can then be used to 
search for the protein identity via web-based programs like MSFit (UCSF). 

4. Automation 

All of the above described steps are automated, for example, into one discrete 

1 0 instrument. In one illustrative embodiment, the first dimension is carried out by a 
Rotofor, with the harvested liquid firactions being directly applied to the second 
dimension non-porous KP HPLC apparatus through the appropriate tubing. The products 
fi-om the second dimension separation are then scanned and the data interpreted and 
displayed as a 2-D representation using the appropriate computer hardware and software. 

15 Alternately, the products from the second dimension fi-actions are sent through the 

appropriate microtubing to a mass spectrometry pre-reaction chamber where the samples 
are treated with the appropriate enzymes to .prepare them for mass spectrometry analysis. 
The samples are then analyzed by mass spectrometry and the resulting data is received 
and interpreted by a processor. The output data represents any number of desired 

20 analyses including, but not limited to, identity of the proteins, mass of the proteins, mass 
of peptides fi-om protein digests, dimensional displays of the proteins based on any of the 
detected physical criteria (e.g., size, charge, hydrophobicity, etc.), and the like. In 
preferred embodiments, the protems samples are solubilized in a buffer that is compatible 
with each of the separation and analysis units of the apparatus. Using the automated 

25 systems of the present invention provides a protein analj^is system that is an order of 
magnitude less expensive than analogous automation technology for use with 2-D gels 
(See e.g., Figeys and Aebersold, J. Biomech. Eng. 121:7 [1999]; Yates, J. Mass 
Spectrom., 33:1 [1998]; and Pinto et al. Electrophoresis 21 :181 [2000]). 

30 

5. Software and Data Presentation 
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The data generated by the above Usted techniques may be presented as 2-D 
images much like the traditional 2-D gel image. In some embodiments, the 
chromatograms, TIC's or integrated and deconvoluted mass spectra are converted to 
ASCII format and then plotted vertically, using a 256 step gray scale, such that peaks are 

5 represented as darkened bands against a white background. The scale could also be in a 
color format. The image generated by this method provides information regarding the pi, 
hydrophobicity, molecular weight and relative abundance of the proteins separated. Thus 
the image represents a protein pattern that can be used to locate interesting changes in 
cellular protem profiles in terms of pi, hydrophobicity, molecular weight and relative 

10 abundance. Naturally the image can be adjusted to show a more detailed zoom of a 

particular region or the more abundant protein signals can be allowed to saturate thereby 
showing a clearer image of the less abundant proteins. This information can be used to 
assess the impact of disease state, pharmaceutical treatment, and environmental 
conditions. As the image is automatically digitized it may be readily stored and used to 

1 5 analyze the protein profile of the cells in question. Protein bands on the image can be 
hyper-linked to other experimental results, obtained via analysis of that band, such as 
peptide mass fingerprints and MSFit search results. Thus all information obtained about 
a given 2-D image, including detailed mass spectra, data analyses, and complementary 
experiments , immuno-affinity and peptide sequencing) can be accessed firom the 

20 original image. 

The data generated by the above-listed techniques may also be presented as a 
simple read-out. For example, when two or more samples are compared (SeCy Section J, 
below), the data presented may detail the difference or similarities between the samples 
(e.g., listing only the proteins that differ m identity or abundance between the samples). 

25 In this regard, when the differences between samples (eg., a control sample and an 
experimental sample) are indicative of a given condition (g.g., cancer cell, toxin 
exposure, etc.), the read-out may simply indicate the presence or identity of the condition, 
hi one embodiment, the read-out is a simple +/- indication of the presence of particular 
proteins or expression patterns associated v/ith a specific condition that is to be analyzed. 

30 

6. lEF-NP RP HPLC in Operation 
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The lEF-NP RP HPLC image shown in Figure 1 is a digital representation of a 2- 
dimensional separation of a whole cell protein lysate from a human erythroleukemia 
(HEL) cell line. This image is designed to offer the same advantages of pattern 
recognition and protein profiling that may be obtained using a 2-D gel. The horizontal 
5 and vertical dimensions are in terms of isoelectric point and protem hydrophobicity, 
respectively. The isoelectric focusing step, performed using the Rotofor, resulted in 20 
protein fractions ranging in pH from 3.2 to 9.5. These fractions were flien injected onto a 
non-porous reversed phase column for separation by HPLC and detection by UV 
absorbance (214 nm). The resulting chromatograms were converted to ASCII format and 
10 then plotted vertically, using a 256 step gray scale, such that peaks are represented as 
darkened bands against a white background. Protein profiles may be viewed in greatCT 
detail by using the zoom feature as shown in Figure 2 and/or by selecting a particular 
Rotofor fiaction and observing the NP RP HPLC chromatogram as shown in the left 
panel of Figure 2. The zoom and chromatogram image features provide a means to 
1 5 observe details in band patterns that may not be observable ua the original image {See, 
Figure 1). In addition, because of the limitations of the 256 step gray scale representation 
the band intensities in areas 1, 2 and 3 of Figure 1 were rescaled by a factor of 3 to better 
show the low abundance proteins. This was preferred since the presence of several high 
abundance protem bands may cause low intensity bands in some regions to be 
20 undetected. In Figure 1 , the total peak area for each individual chromatogram was scaled 
to reflect the relative amount of protein that was found in the original Rotofor fraction 
{See, Figure 3). The band intensities in different chromatograms can therefore be 
compared, directly thus providing a true image of relative protein abundance in the cell 
lysate. The width of the Rotofor fraction columns was adjusted to represent their 
25 estimated pH range. Themolecularweight of proteins observed by lEF-NPRP HPLC 
ranged from 12 kDa to 75 kDa. Typical NP RP HPLC separations, as shown in Figure 4, 
resulted in 35 peaks in 10.5 minutes. The total number of peaks that could be observed 
from all 20 fractions is estimated to be jqjproximately 700. 

The gradient time (to) used in the above experiments is very short and a 
30 significant increase in peak capacity is expected with longer gradients. This is shown 
using Rotofor fraction 17 where two separations were performed with gradient times of 
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10.5 minutes (See, Figure 5A) and 21 minutes {See, Figure 5B). With Xq = 10.5 minutes, 
the average peak width was 0.14 minutes and the peak capacity was therefore 75. The 
actual number of peaks resolved was 35. With tQ = 21 minutes the average peak width 
was 0.23 minutes and the peak capacity was therefore 91. The actual number of peaks 

5 resolved was 51. Using the longer sqjaration time with tQ = 21 minutes the total number 
of peaks observed should increase from 700 to 1000. HowevCT, it should be noted that 
when using mass spectrometric detection, that sufficient resolution should be available to 
ultunately resolve the same number of peaks without using a longer gradient time. 

The proteins in a representative sampling of these peaks were identified using the 

10 traditional approach of enzymatic digestion, MALDI-TOF MS peptide mass analysis and 
MSFit database searching. The magnification of the lEF-NP RP HPLC image enables 
the viewer to perceive more bands than is possible to observe firom the whole image. In 
addition, as shown in Figure 2, the viewer may select a particular band format 
chromatogram and observe the traditional peak format of the chromatogram in a window 

15 to the left of flie unage. This allows the observer to use the peak format chromatogram to 
find partially resolved peaks that may not be observable in the band fonnat 
chromatogram. Five standard protein bands are shown in the left-most column where the 
masses range from 14.2 kDa up to 67 HDa. As RP HPLC separates protems by 
hydrophobicity, these standards are not molecular weight markers as in a traditional 1-D 

20 gel. Rather, they are used to indicate the range of protein molecular wd^ts fliat may be 
observed. Ten different proteins are labeled on the image although many more proteins 
were identified as shown in Table 1, below. In some embodiments of the present 
invention, where it is desired that certain proteins or classes of proteins are to be detected, 
the starting protein sample may be selectively labeled. After the proteins are passed 

25 through the separation step, detection of the proteins can be limited to those that contain 
the selective label. 

B. Protein Separation by 2-D SDS PAGE 

The image in Figure 1 represents the lEF-NP RP HPLC separation of the HEL 
30 cell protein lysate and the image in Figure 6 represents the Coomassie blue (CBB) stained 
2-D SDS PAGE separation of flie same HEL cell line lysate. The pi range for Ms gel is 
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the same as that used for the Rotofor separation and the molecular weight range is from 8 
kDa to 140 kDa. As with the lEF-NP RP HPLC separation a representative sampling of 
the isolated proteins was identified using enzymatic digestion, MALDI-TOF MS and 
MSFit methods {See e.g., Rosenfeld et al., Anal. Biochem. 203:173 [1992]). For the 

5 target protein mass range ofthis study (10 kDa -70 kDa) approximately 188 protein 
spots are observed on the CBB stained gel, 355 from the CBB stained polyvinylidene 
difluoride (PVDF) blot, and 652 from the silver stained gel as estimated using Biolmage 
2D Analyzer Version 6.1 software (Genomic Solutions). The total spot capacity for the 
2-D gel s^aration is estimated to be 2100. The proteins identified from the gel are 

10 labeled on the image and also shown in Table 2, below. Aniinageof another 2-D gel 
sq)aration of HEL cell proteins can be observed via the Swiss-2D PAGE database (See 
e.g., ht)i)://www.expasy.ch; Sanchez etal.. Electrophoresis 16:1131 [1995]). In addition, 
it is possible to view the latest protein Ust for the HEL cell in which 19 protein entries are 
shown (&e e.g., http:/Avww.e3q)asy.ch/cgi-bin/get-ch2d-table.pl). 
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T^blc l.Thlrty Eight Proteins Identified Prom HELCcll lEF-NP RP HPLC Separation 
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NUCLBARRIBONUCLfiDPROTEINS A2/BI 
SMn-AtPKA HOMOlOG 
ALPHA ENOLASE 
PPIASE 

Kl GAMMA4 CHAINCREGION 
CLYCERAlJDEHYDE3-PHO^HATE 
ALPHA ENOLASE 
SM22-ALPHA HOMOUOG 
NUCLEAil RIBONUCLGOPROTEIN Al 
HUCLEAR RIBONUCLBOPROTEINS A2/B I 
BETA ENOLASE 

OLYCERAU3BIIYDE3-rHOSPHATE 
ANNEXINn 

SM22WUJPHA HOMOLOG 

pro. NUCLEAR lUBONUCLeOPROmN 1 

NUCLEAR RIBONUCL8OPR0TEIN Al 

HUCtEAR RIBONUCLEOPROTBINS A2AI 

FRUCroS&aiSPHOSPKATE ALDOLASE A 

GLYCERALOEHYDE 3-PHOSPHATE 

ALPHA ENOLASE 

SM2ir ALPHA KOMOlXXi 

PHOSPHOGLYCXRATE KINASE 1 

PTB, HUCLEAR RmONUCLEOPROTEIN 1 
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Table 2, 



OelSpotl. 
Number 



Nine Proteins Ideatlfied From HEL Cell CBB 2>D Gel 

Enzyme MWt / pi: database SwissProl Protein Name 





trypsin 


g2 


trypsin 


g3 


trypsin 


g8 


trypsin 


glO 


tiypsin 


gn 


tiypsin 


gl2 


tiypsin 


gl3 
gI4 


trypsin 



calculated 



Accession # 



18012,6 / 7.68 Pft^092 PPIASE 

26669.6 / 6,45 pooQ:^« TRIOSEPHOSPHATE ISOMERASE (TIM) 
26669,6/6.45 Pfl0938 TRIOSEPHOSPHATE ISOMERASE (TIM) 

29032.8 / 4.75 TROPOMYOSIN, CYTOSKELETAL TYPE fTMSO-NM) 

32575,2 / 4.64 £06748 NPM 

41737.0/5,29 P0^S7Q BETA-ACTIN 

61055.0/5.70 £10809 HSP-60 

56782.7 / 5.99 E30101 ERP60 

47169,2/7,01 Pflr.7^:^ ALPHA ENOLASE 
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C. lEF-NPRPHPLC versus 2-DSDS PAGE: Protein Loading and 
Quantification 

Each separation method relies upon orthogonal mechanisms of separation 
generating a large number of isolated proteins. Protein profiles may be compared in 

5 terms oftheir pattern as well as the relative amounts of isolated proteins. It is shown, 
however, that the loadabiUty of the liquid phase methods of the present invention greatly 
surpasses that of the gel phase. 

The Umit of detection for the gel method when stained with the silver stain is 
approximately 1 to 10 ng. The Coomassie blue stain can detect 100 ng of protein and the 

10 amount of protein in the spot can be quantified over 2.5 orders of magnitude. For the NP 
RP HPLC of standard proteins \ised m certain embodiments of the methods of the present 
invention, the limit of detection for the UV detector was 10 ng. The protem in the peak 
can be quantified fi-om 10 ng up to 20 ng providing 3.1 orders of magnitude. 
Quantification of an HPLC peak involves integrating the peak to find the area. For the 

15 gel, the spots must first be digitized and then this image must be analyzed to determine 
. the integrated optical density of each spot of interest. The sensitivity of the UV detector 
in embodiments of the present invention utilizing HPLC is competitive with the silver 
stain and quantification is much simpler. The limits of detection for both the silver 
stained gel and the HPLC UV peak detection are mass dependent. For the gel, resolution 

20 and sensitivity are proportional to the molecular weight of the protein. For lEF-NP RP 
HPLC, the resolution and sensitivity are inversely proportional to the molecular weight of 
the protein. The gel appears to provide improved results for both acidic proteins and 
proteins above 50 kDa whereas DBF-NP RP HPLC performs better vydth proteins in the 
basic region and proteins that are below 50 kDa {See e.g., Figure 1 and Figure 6). These 

25 results show the complementary nature of these two techniques whore the gel and lEF- 
NP RP HPLC each provide important information of protein content. 

In one experimoit using the methods of the present invention, 23.5 mg of protein 
was loaded mto the Rotofor, and after a five-hour lEF separation period firactions ranging 
from 2 to 4 mL were collected into polypropylene microtubes. The amount of protein in 

30 the individual fractions ranged from 025 mg to 1.05 mg. Summing the amounts of 
protein in each fraction led to the determination that a total of 10.2 mg of protein was 
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recovered from the Rotofor. This amount can be increased by increasing the amount of 
non-ionic detergent in the Rotofor buffer above the current 0,1% level as well as by the 
addition of thiourea. In contrast, the amount of protein loaded on the 2-D gel in Figure 6 
is 200 ^g. The amount of protein that actually makes it through the gel and focuses to a . 
5 spot has not been quantified, relative to the amount of protein that is actually loaded on 
the gel, though it is known that many hydrophobic proteins are lost during the separation 
(Herbert, Electrophoresis 20:660 [1999]). The amount of protein that may theoretically 
be loaded on a gel ranges from 5 ]ig up to 250 jig whweas for EF-NP RP HPLC the 
initial loading of protein may be as high as 1 gram. The amount of protein actually used 
0 to produce the separation shown in Figure 1 is only a fraction of the amount initially 
loaded into the Rotofor. The image in Figure 1 actually represents the separation of a 
total of 1 to 2 mg of protein though 10,2 mg of protein was recovered from the Rotofor. 
The loading of the HPLC column being used currently could be increased though the 
peak edacity may suffer. Alternatively a larger colunm could be used in series with the 
15 smaller column to allow for higher loadabihty with no loss of separation efiSciency (See 
e.g., Wall e/fl/.. Anal. Chem., 71:3894 [1999]). 

A 2-D gel provides a two dimensional separation from one mitial loading of the 
cell lysate. The intensities of different spots on the same gel are representative of the 
relative protein abundances in the original lysate. However, in the lEF-NP RP HPLC 
20 methods of the present invisntion the proteins are loaded for the lEF and the HPLC 

separations so that the band intensities in the 2-D lEF-NP RP HPLC image depend on the . 
amount of protein loaded to the HPLC from each Rotofor fraction. Since the amount of 
material in each Rotofor fraction is different, the total area of each chromatogram was 
scaled to represent the total amount of protein that was recovered for each Rotofor 
25 fraction (&e, Figured). The result is that the protein band intensities can be coinpared 
both within the Rotofor fraction and between the different fractions. 

In some embodunents of the present invention, 2-D gel techniques are used side- 
by-side with EF-NP RP HPLC. In embodunents where specific proteins are desired for 
further characterization, the gel can provide information indicating which fraction 
30 obtained with lEF-NP RP HPLC contains the desfred protein or proteins. 
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D. Isoelectric Focusing: Liquid vs. Gel Phase 

The principal, concern with liquid phase lEF is that the protein is not 
isoelectrically focused as effectively as it would be in a gel due to difiusion of the protein 
in solution. In the case of -enolase, if one compares the liquid and gel phase images, it 

5 can be seen that in both cases substantial spreadmg of the protein occurs over a wide pi 
range. This range spans from pi 6.5 to pi 9.5 in both the liquid phase and the gel phase. 
For more acidic proteins such as fl-actin, it appears that m the liquid phase the protein is 
more dispersed in the pi dimension than for the corresponding gel separated protem. 
Both methods provide a reasonably accurate assessment of flie pi of the protein of 

10 interest. Refemng to Table 1 , it can be seen that as the Rotofor fraction pH increases, so 
generally does the pi of ideutified proteins therein. The pH of fraction 3 measures 4.2 
and the protdns identified from this fiction range in pi from 4.09 to 5.7. The pH of 
fraction 9 was 5.8 and the proteins identified from that fraction ranged from 5.29 to 6.45. 
The pH of fitUJtion 16 was 7.2 and the pi range of proteins found there ranged from 7.01 

15 to 8.93. The pi accuracy therefore ranges from +/- 0,65 to 1 .73 pi units. This is 

comparable to the carrier aiig)holyte based gel. It should be noted that the pi of a given 
protein limy vary significantly due to post-translational modifications such as 
phosphorylation and glycosylation, as well as to artifactual modifications such as 
carbamylation and oxidation. 

20 

E. Second Dimension Liquid Separation 

Fraction 16, Figure 4, may be used as an example of the quantification of isolated 
proteins. For fraction 16, the volume of injection was 160 jiL. This means that if the 
concentration of protein was 201.4 jig/mL then the amount of protein loaded was 32.2 

25 \xg. The chromatogram was integrated using Microcal Origin software and the total area 
was determined to be 97.78. The areas of peaks 16E and 16J were 3.68 and 5.41 
respectively. Dividing the peak area by the total area gives the fraction ofprotein 
represented by the peak. Therefore, if one assumes 100% protein recovery, the amount 
of PPIASE (16E, tR = 5.68) in 16 was (0.0376 * 32.2 1.21 jig and the amount of - 

30 enolase (16J. tR = 7.45) was (0.0553 * 32.3 fig) 1.78 [ig. The peak areas were generated 
by absorbance of 214 nm light at die amide bonds of the proteins and so should offer low 
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selectivity thereby allowing for a good measure of the amount of protein in the peak 
regardless of the type of protein. 

Figure 4 shows how the continuous integration of the chromatogram maybe used 
to estimate the amount of protein isolated in a given peak. The peak area line is simply 

5 converted mto mass units from which the observer can measure the change in the vertical , 
mass axis that occurs over the width of the peak of interest. If one knows the initial 
concentration of protein in the cell lysate and the number of cells that were lysed, a 
quantitative comparison of different ceU lysates can be made. This comparison is 
important to studying changes in protein expression levels due to some disease state or 

10 pharmacological treatment. In gel work, a technique used for protein quantification in 
different samples is to normalize the .integrated optical density of the spot of interest to 
that of standard proteins whose expression levels are thought to be constant. In this way 
any experimental variation in spot intensity can be corrected. This same method is 
applied to the lEF-NP RP HPLC image to allow for reliable quantification of proteins of 

1 5 interest such that changes in expression level are quantitatively observed. 

The assumption in these experiments is 100% protem recovery. One can 
detennine the actual % recovery of protein and the dependence on elution time. Typical 
protein recoveries have been shown to range from 70 to 95% in NP RP HPLC (Wall et 
a/., Anal. Chem., 71 :3894 [1999]) and so, with a more likely percent recovery of 80%, 

20 the amount of PPIASE and -enolase in fraction 16 would be estimated to be 1 .0 jig and 
1.42 ^g, respectively. 

F. Rotofor Fraction Analysis by NP RP HPLC vs. 1-D SDS PAGE 

NP RP HPLC provides highly efBcient protein separations (See e.g,, Chen et al, 
25 Rap. Comm. Mass Spec, 12:1994 [1998]; Wall et al. Anal. Chem., 71:3894 [1999]; and 
Chong et al. Rap. Comm. Mass Spec, 13:1808 [1999]), and is a far easier method to 
automate as compared to gels in terms of injection, data processing and protein 
coUectioa In addition the NP RP HPLC separations provided by the present invention 
are 70 times faster than the equivalent separation by l-D SDS-PAGE, which requires 14 
30 hours. In the experiments described above, the NP RP HPLC method has greater 
resolving power generating 35 bands where the 1-D gel generates only 26 bands. A 
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direct comparison of the two methods, as shown in Figure 7, reveals that the NP RP 
HPLC bands are mnch narrower than those of the 1-D SDS PAGE over a similar 
molecular weight range. Also it is clear that as molecular weight decreases, the 1-D gel 
bandwidth mcreases substantially. In NP RP HPLC the opposite trend occurs where the 

5 lower molecular weight proteins show improved resolution and sensitivity. This image 
may appear to show that the NP RP HPLC separation fails with larger proteins, as there 
are few bands in the upper region of the image. However, this is not the case as it is 
important to remember that the vertical dimension for NP RP HPLC is not protein 
molecular weight but rather protein hydrophobicity. This is evidenced by the observation 

10 of the elution of bovine serum albumin (66 kDa), a relatively hydrophilic protein, half 
wayi?)anmiage. 

G. Elution Time Prediction for Known Target Protein 

One of the advantages of the 2-D gel is that the vertical coordinate of the gel may 

15 be used to estimate the molecular weight of the protein with a +A 10% error. The 

position of a protein of interest can therefore be estimated before the protein is identified 
from the gel. In an attempt to correlate elution time in the methods of tiie present 
invention with the mass of the protein, a linear fit to a plot of percent acetonitrile at time 
of elution (%B) versus the log(MWt)/protein polar ratio was generated. The polar ratio 

20 (PR) is the number of polar amino acids divided by the total number of amino acids in tiie 
protein and the molecular weight is in kDa. The proteins used for this plot were four of 
the standards listed in Figure 1 as well as a sampling of six of tiie protems from Table 1 
(HSP60, 6-actin, TIM, -enolase, PPIASE and glyceraldehyde-S-phosphate). The 
resulting equation (equation 1: %B/100 = 0.079805*aogMWt)/PR + 0.077686, (R = 

25 0.9677, SD = 0.014722, N = 7)) is used to predict the elution time of target proteins. For 
HSP60, B-actin and -enolase the experimental elution times were 10.28, 10.15 and 7.25 
respectively. The predicted elution times were 10.20, 10.13 and 9.78. In the cases of 
HSP60 and B-actin the prediction works well, whereas for -enolase tiie prediction is not 
as good. While not precise, this prediction does give some idea of when a protein will 

30 elute such tiiat a given target protein, for which tiie niolecular weight and hydrophobicity 
are known, can be found more readily. 
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H. Protein Identification by Enzymatic Digestion, MALDI-TOF MS and 
MSFit Database Searching 

The proteins that were identified firom a representative sampling of the bands 
ftom the lEF-NP RP HPLC separation are Usted in Table 1. A sampling of 

5 approximately 80 proteins firom 12 of the Rotofor firactions were digested and their 

peptide mass maps successfixlly obtained by MALDI-TOF MS. Of these 80, 38 different 
proteins were identified. In this case, identifymg roughly 50% of the proteins searched is 
to be Mpected as not all the proteins are in the available databases. Similar results were 
observed for proteins analyzed torn 2-D gels of the HEL cell samples. The current table 

10 in Swiss-2D PAGE lists 1 9 protein entries for the HEL cell. Of these 1 9 proteins, five 
were identified firom the lEF-NP RP HPLC separation. In the gel, these same five 
proteins were also identified. 

Li general, it appears that the gel MSFit results are better than those firom the 
liquid phase. This can be attributed to the fact that the gel proteins were reduced and 

1 5 alkylated with DTE and iodoacetamide respectively prior to the nmmng of the second 
dimension. This step would help insure that all disulfide bonds are broken and optimal 
proteolysis is produced. Thus, this derivatization step can be added to the lEF-NP RP 
HPLC method, by performing the reduction and alkylation step prior to NP RP HPLC or 
during cell lysis. Nevertheless, in some cases the lEF-NP RP HPLC digestions surpassed 

20 those fi-om the gel in coverage and quality. This is evidenced in Figure 8, which shows a 
direct comparison of the MALDI-TOF MS for -enolase as isolated via the lEF-NP RP 
HPLC method and the gel method. These mass spectra were calibrated extemally at first 
and the mass profiles used to search the Swiss protein database with a mass accuracy of 
400 ppm. These searches gave strong hits to -enolase for both the gel and the liquid 

25 protein digests. Each mass spectrum was then recalibrated intemally using matched 
peptide peaks firom the initial extemally calibrated match. The new peak table was then 
used to search the same Swiss protein database but with 200 ppm mass accuracy. Figure 
8 clearly shows that the digestion firom the liquid phase is improved compared to that 
fi-om the gel. The lEF-NP RP HPLC mass spectrum matches to 60% of the protein 

30 sequence whereas that from the gel matches to 49%. Achieving a match to 60% of the 
sequenceofa47 kDa protein is very unusual for MALDI-TOF MS analysis and 
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represents a significant improvement over gel digests. Although the preset invention is 
not limited to any particular mechanism, the mcrease in sequence coverage may be due to 
the fact that the protein is digested in the liquid phase, is relatively pure, and because the 
peptides are not lost due to being embedded inside the gel piece. Also if one observes the 
5 level of methionine oxidation in the peak that matches to T163-179, it is clear that the 
protein isolated by lEF-NP RP HPLC is far less oxidized than that from the gel. 

Many of the MP RP HPLC chromatograms contain some peaks that are not fully 
resolved to baseline. This need not be a problem as partially resolved proteins can still be 
effectively identified using MALDI-TOF MS analysis. In Rotofor fraction 3 there are 
10 peaks at 10.15 minutes and 10.25 minutes {See, Table 1). These peaks are only resolved 
to 50% above the baseline and yet it is clear that the peak eluting at 10.15 minutes is fl- 
actin and the peak eluting at 1 0.25 minutes is HSP-60. Note Hiat the predicted elution 
times for these proteins are 10.13 and 10.20 minutes respectively. As proteins can be 
identified from partially resolved peaks, faster separations witii more rapid gradients are 
15 possible. The reproducibility of the pattern of bands can be determined by looking at the 
retention times for particular proteins as observed from diflFerent Rotofor fractions. B- 
actin elutes at 10.15 minutes in both fractions 3 and 9; -enolase elutes at 7.25, 7.45 and 
7,39 minutes in fractions 12, 16 and 20 respectively, and HSP-60 elutes at 10.28 and 
10.25 minutes in fractions 3 and 4 respectively. Clearly, witii +/- 0.1 minutes variation m 
20 the retention times, these separations are quite reproducible from run to run. 

Thus, the methods of the present invention have been shown to provide 
advantageous mefliods for the reproducible separation of large numbers of proteins. In 
the human erythroleukemia cell lysate example, the methods are capable of resolving 700 
bands with a rapid gradient, and 1000 bands with a longer gradient. There were 38 
25 different proteins tentatively identified, by MAIJ5I-T0F MS and MSFit database 

searching, after analysis of a fiction of these bands. This compares favorably with the 
19 different proteins that have been identified to date from the 2-D gel. Some of the 
proteins found in the human erythroleukemia cell lysate; including -enolase 
(Rasmussen et a/.. Electrophoresis 19:818 [1998] and Mohammad et a/., Enz. Prot, 
30 48:37 [1994]), glyceraldehyde-3-phosphate dehydrogenase (Bini et al. Electrophoresis 
18:2832 [1997] and Sirover, Biochim. Biophys. Acta 1432:159 [1999]), NPM (Redner et 
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al. Blood 87:882 [1996]), CRKL (ten Hoeve et al, Oncogene 8:2469 [1993]), and heat 
shock protein (HS27) (Fuqua et al. Cancer Research 49:4126 [1989]), have been Unked 
to various fonns of cancer. NPM and CRKL have been linked specifically to lenkenaias. 
The proteins identified in one exemplary experiment ranged firom 12 kDa up to 75 

5 kDa (alfliough broader ranges are contemplated by the present invention); this range may 
mclude many of the proteins of interest to current research involving protein profiling, 
identification and correlation to some disease state or cell treatment, hi sharp contrast to 
2-D gels, ttris method is well-suited to automation. Mass spectrometric methods can be 
applied, such as ESI-MS and MAIDI-TOF MS, to the detection of whole proteins and 

10 protein digests. Most importantly, the methods of the present invention provide an 

altemative 2-D protein map to the traditional 2-D gel and appears to improve results for 
lower mass proteins and more basic proteins. A key advantage of the Uquid 2-D 
sq)aiation is that the end product is a purified protein in the hquid phase. Also, since the 
initial protein load can be fifty times that of the gel, the amount of a target protein that 

15 may be isolated by oiie lEF-NP RP HPLC separation is potentially fifty times higher than 
that obtainable fiom a 2-D gel separation. Additionally, in the case that the investigator 
is interested in specific proteins where the pi is known, this method may be used to 
isolate and identify the target protein in less than 24 hours, since only the firaction of 
interest need be analyzed via the second dimension separation. The gel-based method 

20 would require three days to achieve the same result 

L Identification of Novel Tumor Antigens 

There is substantial interest in identifying tumor proteins that are immunogenic. 
Autoantibodies to tumor antigens and the antigens themselves represent two types of 

25 cancer markers that can be assayed in patient serum and other biological fluids. lEF-NP 
RP HPLC-MS has been implemented for the identification of tumor proteins that ehcit a 
humoral response in patients with cancers. The identification of proteins that specifically 
react with sera from cancer patients was demonstrated using this approach. Solubilized 
proteins from a tumoral cell Une are subjected to lEF-NP RP HPLC-MS. Individual 

30 fiactions defined on the basis of pi range are subjected simultaneously to one- 
dimensional electrophoresis as well as to HPLC. Sera from cancer patients are reacted 
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with Western blots of one-dimensional electrophoresis fractions. One band which 
reacted specifically with sera from lung cancer patients and not from controls was found 
to contain both Annexin II and aldoketoreductase. The ability to subfractionate further 
proteins contained m this fraction by HPLC led to the identification of Annexin II as the 
5 tumor antigm that elicited a humoral response in lung cancer patients. 

J. Comparative Analysis 

As is clear from the above description, the methods of the present invention offer 
the opportunity to compare protein profiles between two or more samples (e.g., cancer vs. 

10 control cells, undifferentiated vs. differentiated cells, treated vs. untreated cells). Li one 
embodiment of the present invention, the two samples to be compared are run in parallel. 
The data generated from each of the samples is compared to determine differences in 
protein expression between the samples. The profile for any given cell type may be used 
as a standard for deteraiining the identity of fiiture unknown samples. Additionally, one 

15 or more proteins of interest in the expression pattern may be finiher characterized (e.g. , to 
detennine its identity). In an alternative embodiment, the proteins from the samples are 
run simultaneously. In these embodimaits, the proteins from each sample are separately 
labeled so that, during the analysis stage, the protein expression patterns fix>m each 
sample are distinguished and displayed. The use of selective labeling can also be used to 

20 analyze subsets of the total protein population, as desired. 

As is clear from the above description, the methods and compositions of the 
present invention provide a range of novel features that provide improved methods for 
analyzing protein expression patterns. For example, the present invention provides 
methods that combine lEF, resulting in pl-focused proteins in liquid phase flections, with 

25 nonporous RP HPLC to produce 2-dimensional liquid phase protein maps. The data 
• generated from such methods may be displayed in novel and usefiil formats such as 
viewing a coUection of different pi NP RP HPLC chromatograms in one 2-D image 
displaying the chromatograms in a top view protein band format, not the traditional side 
view peak format. As shown in Figure 2, the side view peak format is shown to flie left 

30 and the top view band format is shown to the right The present invention also provides 
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detergents that are compatible with automated systems employing multi-phase separation 
and detection steps. 

The present invention provides additional characterization steps, including the 
identification of proteins separated by DEF-NP RP HPLC using enzymatic digestions and . 
5 mass spectrometric analysis of the resulting peptide mass fingerprints. Proteins may be 
detected to determine their molecular weights by analyzing the effluent from the HPLC 
with either off-line collection to a MAIDI plate (Perseptive) or on-line analysis using 
orthogonal ^traction time-of-flight. The data generated from such methods may be 
displayed in novel and useful formats such as using the data from the MALDI or LCT 
10 generated protein molecular weights to generate total ion chromatograms (TIC) that 

would be virtually identical to the original UV-absoibance chromatograms. The signal of 
these chromatograms would be based on the number of ions generated from the HPLC 
effluent of a given group of pl-focused proteins, not by absorption of light These 
chromatograms are plotted in the same 2-D top view band format as mentioned above. 
15 These methods allow one to fully integrate and deconvolute each of the TIC's generated 
to display complete mass spectra of each collection of pl-focused proteins. The methods 
also allow the display of all the integrated TIC's in one 2-D image where the vertical 
dimension is in terms of protem molecular weight and the horizontal dimension is m 
terais of protein pL The protein mass spectra appears as bands as they are also viewed 
20 from the top. This image would therefore also contain quantitative mformation (in the 
case of the LCT) and so the bands would vary in intensity depending on the amount of 
protein present. 

The Uquid phase methods for protem mass mapping would also allow for 
collection of protein flections to microtubes such that the proteins could be digested and 

25 the peptide mass maps analyzed to detennine the identity of said proteins simultaneously. 
Laser induced fluorescence (LDF) detection schemes are used in conjunction with this 
method to increase the overall sensitivity by three orders of magnitude. The Uquid phase 
LIF detector provides more sensitive fluorescence detection than in the gel as there would 
be no gel background fluorescence. This LIF detection method could be used m a 

30 number ofways including, but not limited to: 
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1) Combining equal amounts of two cell lysates that have 
each been previously stained \with a different fluorescent dye 
followed by use of a dual fluorescence detector to simultaneously 
detect the same proteins fix)m two dififerent cell lysates. This 

5 would allow for very accurate comparisons of the relative amounts 

of proteins foimd for different cell lines or tissues; and 

2) Using a fluorescently tagged antibody to label specific 
target proteins in a cell lysate such that fliey can be targeted for 
thorough analysis witiiout looking at all the other proteins. 

10 The methods and ^aratusesofthe present inveaition also ofifer an efficient 

system for combining with other analysis techniques to obtain a thorough 
characterization of a given ceU, tissue, or the like. For example, the methods of the 
present invention may be used in conjunction with genetic profiUng technologies (e.g., 

gene diip or hybridization based nucleic add diagnostics) to provide a fuller 
15 understanding of the genes present in a sample, the expression level of the genes, and the 
presence of protein (e.g., active protein) associated with the sample. 

in. Improved Elntion Techniques Using Chromatofocusing 
As described above, the present invention provides novel liquid chromatographic 
20 methods involving a 2-colunm 2-D separation of proteins from whole cell lysates 

followed by on-line mass mapping with by mass spectrometry (e.g., using ESI-oaTOF 

MS as described in detail below). It is a 3-D protein analysis system as proteins are 
separated based upon, for example, their isoelectric points (pi) in the first LC dimension. 
The present invention fiirther provides novel techniques .for eluting proteins from 

25 a separation ^paratus (e.g., the first phase separation apparatus), For example, in one 
embodiment of this technique, the proteins eluted from the first dimension are peeled off 
from the column according to their pH, either one pH unit or fraction thereof at a time- 
referred to as chromatofocusing (CF). These focused Uquid fractions are then separated 
according to their hydrophobicity and size (or other desired properties) in the second 

30 dimension. Liquid fractions from, for example, NP-RP-HPLC can be conveniently 
analyzed directty on-line using mass spectrometry (e.g., ESI-oaTOF) to obtain their 
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molecular weight and relative abundance, which provides a third dimension. As a result, 
a virtual 2-D protein image is created and is analogous to a 2-D gel image. Furtheraiore, 
this 2-D protein image includes vital information such as the pi, hydrophobicity, 
molecular wei^t, and relative abundance. This Protein Peeling 2-D LC-MS method is 
5 a practical alternative to 2-D gels in order to study protein expression between nonnal 
and disease whole cell lysates, for example. This whole system can be fiiUy automated 
and integrated into a single unit for rapid proteome analysis, providing a more accurate 
and less e:q)ensive automation technology compared to automation technologies for use 
witii 2-D gels. 

10 An exemplary embodiment of the chromatofocusing techmques of the presrat 

invention are provided in Example 7. Data from these experiments is shown in Figures 
14-16. Figure 14 shows tiie CF profile of MCF-lOA whole cell lysate (pH 7 to 4). 
Fractions 1 to 3 were further analyzed with NP-RP-KPLC-ESI-oaTOF MS (described in 
detail below). Figures 15A-C show the NP-RP-HPLC-ESI-oaTOF TIC (total ion count) 
15 profile of the three fractions from Figure 14: (A) firaction 1 (pH 6.75 - 6.55); (B) fraction 
2 (pH 5.50 - 5.25); and (C) fraction 3 (pH 5.20 - 4.90). By integrating and deconvoluting 
the TIC profiles with the MaxEntl software (described in detail below), the mass spectra 
for all three fractions are displayed in a 2-D foriMt as shown in Figure 1 6. Figure 16 
shows the integrated TIC in one 2-D protein map where the vertical column is the 
20 molecular weight while the horizontal dimension is the protein pi point. This map also 
contains the relative abundance information whereby the bands vary in intensity (shades 
of gray) depending on the amount of the protem present. 

The data generated by CF-NP-RP-HPLC-ESI-oaTOF MS can be presented as 2-D 
maps or 2-D images much like the traditional 2-D gel images. For example, in some 
25 embodiments, the chromatograms, TICs, integrated and deconvoluted mass spectra are 
converted into the ASCE format before being plotted vertically, using a 256-step gray 
scale, such that peaks are represented as darkened bands against a white background. 
This scale comes in a variety of color formats. Therefore, this 2-D map provides vital 
information on pi. hydrophobicity, molecular weight as weU as the relative abundance of 
30 separated proteins. This map can also be adjusted by zoom into a specific area of 

interest, for a more detailed image of all the bands therein. AU the information gathered 
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fiom this 2-D map can be used to examine protein expression in a cell system due to the 
disease state, pharmaceutical treatment or environmental change. Since the image is 
automatically digitized, it can be easily stored and the bands can be hyperlinked to other 
experimental results or related data. As a result, all the information is available from the 
5 origfaial image. 

The use of chromatofocusing with the separation, analysis, and display methods 
of the present invention provide a number of important advantages not previously 
available. For example, by combining chromatofocuising with a second separation phase 
(e.g., NP-RP-HPLC) and mass spectrometry analysis, a 2-D liquid phase protein map is 
10 generated which is analogous to a 2-D gel. In preferred embodiments, this is a multi- 
dimensional liquid chromatogr^hy (LC) whereby both chromatographic techniques are 
perfonned on-line (i.e., in an automated fashion) between two or multiple LC units with a 
switching valve to deliver fractions from CF to, for example, NP-KP-HPLC. Proteios are 
peeled off the CF column according to their pH, one pH unit or fraction thereof, at a 
15 time. This peeling feature allows for further focusing of the protein bands at their 

respective pi regions. The protein concentration of each pi band is thus enhanced during 
elution. As with the method described above, buffers can be used that are compatible 
with each stq> of the process. For example, in some embodiments, the sample 
preparation and CF separation involves the use of guanidine-hydrochloride and a 
20 nonionic detergent (e.g., n-octyl -D-glucopyranoside) that is compatible with the NP- 
RP-HPLC and ESI-oaTOF MS. 

The present invention may be used in developing protein microarrays. In 
preferred embodiments, the present invention provides a new method for preparing large 
numbers of intact proteins from cell lysates for microchip or other solid surface array 
25 analysis. This method involves a 2-D (or more) fractionation of proteins from a cell 
lysate. In particularly preferred embodiments, the 2-D fractionation of proteins &om a 
cell lysate first involves the separation of intact proteins fiom cell lystates using 
chromatofocusing. Next, each fraction is further purified with non-porous silica RP- 
HPLC. This preferred embodiment yields a 2D-liquid phase firactionation of the proteins 
30 from a ceU lysate, at a relatively high purity, hi addition, this preferred embodim^^ 

pennits the spotting onto a microarray of thousands of isolated proteins in a liquid phase. 
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An exemplary embodiment of developing protein microarrays with chromatofocusing 
and HPLC techniques is provided in Example 1 1 . 

Microairayed proteins find many uses. For example, the present invention may 
be used in studying a humoral response, hi preferred embodiments, the present invention 

5 provides a new method for identi^g sero-diagnostic markers for disease (e.g., cancer). 
In further preferred embodiments, the present invention provides a new method for 
identi^g sero-diagnostic markers for prostate cancer. The fractionated proteins can be 
spotted on a nitrocellulose shde or any other suitable surface and used in the study of a 
humoral response by exposing it to sera from cancer patients and normal individuals (or 

10 from any two samples to be coinpared). This mefliod allows for comprehensive analj^s 
of the cancer proteome using very small amounts of analyte obtained by fractionation. 
Inq>ortantiy, this method allows for use of post-translationally modified proteins as baits 
for detection of humoral response. Proteins eliciting an immune response are identified 
using, for example, the molecular mass and peptide sequMice data obtained using mass 

15 spectrometric analysis of Uquid fractions. As such, this method further may be used as a 
diagnosfic agent in the diagnosis of disease (e.g., cancer). An exemplary embodimeiit of 
studying a cancer humoral response with chromatofocusing and HPLC techniques is 
provided in Example 1 1. 

20 IV. Mass Spectroscopic Analysis and 2-D Display Systems and Methods 

In some preferred embodiments of the present invaition, separated proteins are 
analyzed by mass spectrometry to facilitate the generation of detailed and informative 2- 
D protdn maps. The present invention is not limited by the nature of the mass 
q)ectrometiy technique utilized for such analysis. For example, techniques fliat find use 

25 widi the present invention include, but are not Umited to, ion trap mass spectrometry, ion 
trq)/time-of-flight mass spectrometry, quadnqwle and triple quadrupole mass 
spectrometry, Fourier Transform QCR) mass spectrometry, and magnetic sector mass 
spectrometry. The following description of mass spectroscopic analysis and 2-D protein 
display is illusfrated with ESI oa TOF mass spectrometry. Those skilled in the art will 

30 ^preciate the qipUcabiUty of other mass spectroscopic techniques to such mefliods. 
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In some embodimoits of the present invention, ESI oa TOF mass spectrometry is 
used following two dimensional protein separation to provide an accurate protdn 
separation map. For example, in one embodiments of the present invention, proteins 
were analyzed from human erythroleukemia(HEL) cells. The human erytbroleukemia 

5 PEL) cell line was obtained from the Department ofPediatricsatThe University of 
Michigan. HEL cells were cultured according to the methods described in Example 1. A 
preparative scale Rotofor (Biorad) was used in the first dimension separation. In this 
experiment, 20 mg of protein was loaded. The proteins were separated by isoelectric 
focusing over a 5 hour period with slight modifications to the Rotofor methods described 

10 elsewhere herein. The separation temperature was 10°C. and the separation buffer 
contained 0.5 % n-octyl -D-glucopyranoside (OG) (Sigma), 6 M urea (ICN), 2 M 
thiourea aCN), 2% -mercaptoethanol (Biorad) and 2.5 %Biolyte ampholytes, pH 

3.5-10 (Biorad). 

The procedure used for running the Rotofor (Rotofor Purification System, Biorad) 
15 was a modified version of flie standard procedure described in the manual from Biorad. 
The starting power, voltage and current were 12 W. 400 V and 36 mA respectively. The 
ending power, voltage and current were 12 W, 1000 V and 5 mA respectively. The 20 
fiactions contained in the Rotofor were collected simultaneously into separate vials using 
a vacuum source attached by plastic tubing to an array of 20 needles, which were 

20 punched through a septum. The Rotofor fractions were aliquotted in 400 jiL amounts 
into polypropylene micro-centrifiige tubes and stored at -80»C for ftrther analysis as 
desired. The pH, of the fractions was determined using pH indicator ^aper (Type CF, 
Whatman). Fractions from the Rotofor were quantified using a Bradford assay (See e.g.. 
Wall erci. Anal. Chem., 72:1099 [2000]). 

25 For NPSRPHPLC, separations were performed at a flow rate of 0.4 mL per 

minute on an analytical (3.0 * 33 mm) NPSRPHPLC column containing 1.5 jim C18 
(ODSI) non-porous silica beads (Eichrom Technologies). Tte use of the 3 mm column 
provided more than sufficient sensitivity with the use of the LCT as well as reduced 

sohrent consumption. The colmmi was placed in a column heater (Timberline, Boulder 
30 CO) and maintained at 650C. Tlie separations were performed using water/acetonitrile 
(0.1 % TFA, 0.3% formic add) gradients. The gradient profile used was as follows: 1) 0 
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to 20 % acetonitrile (solvent B) in 1 minutes; 2) 20 to 30 % B in 2 minutes; 3) 30 to 54 % 
B in 8 minutes; 4) 54 to 65% B in 1 minute; 5) 65 to 100 % B in 1 minute; 6) 100 % B in 
3 minutes; 7) 100 to 5 % B in 1 minute. The effective start point of this profile was one 
minute into the ^adient due to a one-minute dwell time. The acetonitrile was 99.93 +% 

5 HPLC grade (Sigma), the TFA was from 1 mL sealed glass ampules (Sigma) and the 
fonnic acid was ACS grade (Sigma). The non-ionic detergent used was n-octyi 
. -D-galactopyranoside (OG) (Sigma). The HPLC instrument used was a Beclanan 
model 127S/166 and tiie peaks were detected on-line by a commercial ESI oa TOF/MS 
(LCT, Micromass, Manchester UX). In prefened embodiments, a detergent is used 

10 throughout the separation and detection steps that is compatible with the steps of RP 
HPLC and ESI oa TOF/MS (eg., detergents of the formula n-octyl (SUGAR) 
pyianoside). 

The ESI oa TOF/MS analyses wore performed on a Micromass LCT equipped 
with a reflectron, a 0.5 meter flight tube and a dual micro-channel plate detector. The 

15 instrument produced proteinmass spectra with a mass resolution of 5000 (FWHM). The 
flow ftom the HPLC column eluent was spUt to the ESI stainless steel capillary at a 1 : 1 
ratio leaving a flow to the mass spectrometer of 0.2 mL/minute. The source temperature 
was held at 150°C. the desolvation temperature was 400°C, the nebulizer gas (N2) was 
left at 50% maximum flow and flie desolvation gas was held at 600 Uminute. The 

20 capillaryvoltagewasheldat+2500 Vandthesampleconevoltagewasheldat+45 V. 
The extraction cone was held at +3 V. The RF voltage was set at 1000 V with the first 
hexapole bang biased to a positive DC offeet of +7 V and the second hexapole being 
biased to a negative DC effect of -2 V. Hie detector voltage was held at 2900 V. Data 
was acquired for a maximum mass/charge range of 5000 resulting in a pusher cycle time 

25 of 90 (is. The data was stored to the ECP at a ratRof 1 Hz and then transferred from this 
data-collecting computer to the main data analysis computer for generation of the data 
files andTIC. 

Software used to analyze the mass spectra was the MaxEnt (version 1) software 
and Mass Lynx version 3,4 (Micromass). Typical deconvolution was performed with a 
30 wide target mass range, 1 Dalton resolution, 0.75 Da peak width and 60% peak height 
values. All deconvoluted mass spectra from a given TIC were added together to produce 



55 



wo 2004/063719 



PCTAJS2004/000774 



one mass spectrum for each TIC. Hie TIC mass spectra from each of the Rotofor 
fractions were thrai input to the 2D m^ing software (available from Dr. Stephen J . 
Pams, Ifaiversity of Midiigan, Department of Chemistiy, 930 N. Itoiversity Ave., Ann 
Aibor, MI 48109-1055). 
5 The 2-D image in Figure 9 shows protein molecular weight in flie vertical 

dimension and protein pi in the horizontal dimension, hidividual proteins are represented 
as bands within the grayscale image. Protein identities were matched to ttiis image by 
overlaying a virtual map of all proteins previously identified via the NPS RP HPLC 
separation mefliod described above and digest analysis wifli MSFit database searching. 
10 The ejqjerimental mass values were typically within 1 to 3 parts per thousand of 

the value recorded in the SWBS-PROT database. The pi could be estimated to within 
0.01 to 0.5 pi units using intensity profiling as described below. Each vertical lane 
represents, in band format, all proteins observed via LCT mass spectral detection from 
flie NPS RP HPLC analysis of Aat particular Rotofor fraction. The MPS RP HPLC 
15 separations were perfomied on &om 17 to 60 \ig of protein per Rotofor fiaction. The 
bands in the im^e vary in gray scale intensity according to the intensity of the source 
molecular weight peaks. This image has been magnified in the intensity dimension by 
allowing virtual saturation of the signal of the more abundant proteins. The 
magnification factor is 27X or 53615/2000 (max intensity/magnification intensity). The 
20 intensity has a linear dynamic range ofat least 3 orders of magnitude. Some of the same 
protein patterns can be seen in both the Uquid phase separation and a 2D gel image fixmi 
Swiss-Prot (htlp://expasy.cbr.nic.ca/ch2dothergifs/publi/elc.gif). Five of the nineteen 
proteins identified in the 2D gel image also were found in the liquid phase separation. 
When comparing tiiese images it must be kept in mind that the mass scale is linear from 
25 flie liquid phase separation and logarithmic in the gel phase separation. 

The pi of proteins isolated in tiie 3D liquid sq>aration method can be estimated by 
• observingtheintensityofagivenproteinpeakoverarangeofplfractions. Asaprotein 
may spread anywhere from 2 to 6 pi fractions due to diffusion and basic cathodic drift, it 
should be most abundant in that, fraction tiiat is closest to its own pL This can be 
30 observed in the zoom image of Figure 10 (See also, zoom image of Figure 13). Using 
fiiis approach, flie pi of alpha-enolase is estimated to be 7.0 (database value of 7.01), and 
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the pi of glyceraldehyde 3-PO4 dehydrogenase is estimated to be 8.0 (database value of 
8.57). This acidic shift may be due to a post-translational modification such as 
phosphoiylation or glycosylation. 

The protein molecular weights were determined by MaxEnt deconvolution of 

5 multiply charged protein umbrella mass spectra that were obtained by combining 

anyvrfiere firom 10 to 60 seconds of data from the initial total ion chromatogram (TIQ. 
The umbrella for heta and gamma actin is shown in Figure 1 1 A, each form of actin being 
labeled with the charge state. Figure 1 IB shows the resulting molecular weight mass 
spectrum for actin where the two forms of actin are separated. Note that the two forms of 

10 actin are clearly resolved firom one another unlike in gel images where the actin spot 
always represents the co-migration of beta and gamma actin. A useful feature of the 
liquid phase method of the present invention is the capability of the high resolution mass 
spectrometry to quantitate which allows the observer to record relative levels of each 
form of a given protein. Consequently, it is contemplated that one cam detenooine the 

1 5 relative abundances of the phosphorylated and non-phosphoiylated forms of a given 
protein. In addition, post-translational modifications such as phosphorylation can be 
found by searching the data for intervals of some integer value times 80 Da. 

Figure 12 shows the traditional peak view format of one of the Rotofor fi-action's 
combined molecular weight mass spectra. All proteins were deconvoluted and then 

20 added together into one mass spectrum. There are 44 unique protein molecular weights 
observed in this mass spectram. Assuming similar numbers of unique masses in all 15 of 
the Rotofor firactions analyzed herein, and accounting for longitudinal diffusion between 
fictions, it is estimated that approximately 220 unique protein masses in the image &om 
a pi of 4. 1 to a pi of 8.75. The Rotofor produces 20 firactions, though only 1 5 were 

25 analyzed in this work, so that around 300 unique masses should be observed in the fiiU 
analysis of all Rotofor firactions. It is contemplated that lower level proteins not obtained 
in the above experiment can be obtained using improved HPLC gradients, 53 mm long 
columns and more detailed MaxEnt analyses. Using such methods, it is contemplated 
that the number of unique masses will be around 750. 

30 As shown in the above experiments, the 2D protem image from the lEF-NPS RP 

HPLC-ESI oa TOF/MS separation of the human erythroleukemia cell lysate provides 
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high mass resolution and high accuracy imaging of the proteins. The mass resolution 
allows the image to show very different forms of the same protein that have small 
differences in mass. With a mass resolution of 5000 Da, a 50000 Da protein can be 
resolved from a 5001 0 Da protein. Clearly, single phosphorylations on entire proteins 

5 can be observed with this level of resolution. Quantitative comparison between 2-D 

images can be achieved by spiking samples with known amounts of standard proteins and 
normalizing images through landmark proteins. Thus, the observer can detect significant 
abundance changes in the protein profiles of different samples. The differences can then 
be targeted for more detailed analysis. For example, protem bands on the image can be 

10 hyp^-linked to other experimental results, obtained via analysis of that band, such as 
peptide mass fingerprints and MSFit search results. Thus all information obtained about 
a given 2-D image, mcluding detailed mass spectra, data analyses and complementary 
experiments (immuno-afBnity, peptide sequencing) can be accessed &om the original 
. image. 

1 5 Having idaitified and characterized the proteins that have changed in abundance 

due to some disease state or drag treatment, it is possible to identify biomarkers for 
disease states as well as drug targets for pharmaceutical agents and monitor the presence 
of; or change in, such markers in a particular biological sample (e.g. , tissue samples with 
and without exposure to a candidate drug). Indeed, drug screening and diagnostic 

20 techniques can be automated using the systems and methods of the present invention, 
wherein cells (e.g., experimental and control cells) are cultured, treated, and lysed using 
it)botics and wherein the lysate is fed into tiie automated separation and analysis systems 
of the present invmtion. 

As is clear from the above description, the methods and systems of the present 

25 invention provide a range of novel features that provide improved methods for analyzing 
protein expression patterns. For example, the present invention provides a combination 
of lEF, resulting in pl-focused proteins in liquid phase fractions, with nonporous RP 
HPLC and ESI oa TOF/MS to produce a 2-dimensional liquid phase protein map image 
analogous to that of a 2-D gel. These methods allow the identification of proteins 

30 separated by lEF-NPS RP HPLC using enzymatic digestions and mass spectrometric 
analysis of the resultibig peptide mass fingerprints and correlation of this data with the pi 
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and molecular of lite protein foirnd via the whole protein 3-D separation metiiod. hi some 
improved display embodiments of the present invention, one can view a collection of 
different lEF-NPS RP HPLC-ESI oa TOF/MS chromatograms in one 2-D image 
displa5rag the mMS spectra in atop view protein band format, not Ihe traditional side 

5 view peak fonnat The methods also allow the detection of proteins and deterannation of 
Iheir molecular weights by analyzing the eluent from the HPLC with computational (e.g., 
on-line) analysis using ESI 6a TOF/MS, 

The ffiP-NPS RP HPLC-ESI oa TOF/MS method also allows one to folly 
integrate and deconvolute each of the TIC's generated to display complete mass spectra 

10 of each collection ofpl-fbcused proteins. The method also allows the display of aU the 
integrated TIC's in one 2-D image where the vertical dimension is in terms of protein 
molecular weight and the horizontal dimension is in terms of protein pL In such displays, 
the protein mass spectra ^ear as bands as they will also be viewed from the top. This 
image would therefore also contain relative quantitative information wherein the bands 

15 vary in intensity depending on the amount of protein present. The use of liquid phase 
separation techniques with the method allows for coUection of protein fractions to micro- 
tubes or 96-well plates such that the proteins could be digested and the peptide mass 
m^s analyzed to determine the identity of said proteins simultaneously. 

20 V. Automated 3D HPLC/MC Methods for Rapid Protein Characterization 

In some embodiments, the present invention provides an automated system for the 
separation and identification of protein samples based on multiple physical properties. 
Accordingly, in some embodiments, the protein separation and analysis techniques 
described in the preceding sections arc automated mtoone integrated, on-line system 
25 Protdn samples are separated in afirst phase and a second orthogonal phase, followed by 
mass spectroscopy analysis. In preferred embodiments, all of tiie steps are automated and 
coordinated through an automated sample handler and a centralized control network. 

Accordingly, in some embodiments, the entire separation and characterization 
process is controUed through one centralized control network. The network is integrated 
30 with all of tiie i^jparatus and software used for the automated process, fri some preferred 
embodiments, the centralized control network includes a computer system. The use of a 
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centralized control network allows for the entire separation and characterization process 
to be controlled from one computer terminal by one operator. The network directs 
sample through the ^propriate separation phases. The network then controls the transfer 
of protein information to analysis software. The analysis software is integrated into the 
5 network and can be programmed to gaierate a customized report based on the 
information required by the user. 

A. Protein Separation 

As described above, flie present invention provides methods for the separation of 
10 protein samples in two phases. In preferred embodiments, the methods are orthogonal, 
and thus allow for the generation of a two-dimensional m^. Jn some preferred 
embodiments, the present invention ftirther provides methods of automating the two 
phase separation. 

15 1, Separation in a First Phase 

The automated separation methods of the present mvention may be used on any 
suitable protem sample. As discussed above, in some embodiments, the sample is 
solubilized in a buffer comprising a compound of the formula n-octyl SUGAR 
pyranoside(e.^„ including, but not limited to, n-octyl -D-glucopyransoside and n-octyl 

20 -D-galactopyransoside). 

The first dimension of the automated separation process separates proteins based 
on a first physical property. For example, in some embodiments of the present invention 
proteins are separated by charge (eg., ion exchange chromatography), hi some preferred 
embodiments, cation exchange chromatography is used to separate positive proteins and 

25 anion exchange chromatography is used to separate negatively charged proteins. 
However, the first dimension may employ any number of separation techniques 
including, but not limited to, ion exclusion, isoelectric focusing, normal/reversed phase 
partition, size exclusion, Ugand exchange, liquid/gel phase isoelectric focusing, and 
adsorption chromatography. 

30 hi some preferred embodiments, the first separation phase is conducted in the 

liquid phase. In some embodiments, the first phase is ion exchange. In such . . 
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embodmraits, it is prefened that samples are de-salted prior to the second separation 
phase. In some embodiments, desalting is performed on an automated solid phase 
ejrtraction (SPE) system. In some embodiments, both the ion exdiange and the desalting 
are performed oh the same automated SPE system, hi other emboduneaits, the ion 
5 exchange is performed on a column and tiie eluate is directed into the automated SPE 
sj^em. 

In some embodimmts, if proteins are present in small amounts, samples can be 
loaded onto the SPE columns multiple tunes in order to obtain a sufficient amount for 
analysis. Thus, the present invention has the added advantage of allowing the 
10 identification ofproteins with a low level of expression. 

2. Automated Sample Handling 

As described in the preceding section, in preferred embodiments, samples are 
processed using an automated sample handling system. The present invention is not 

15 limited to any one automated sanqjle handling systan. However, in some preferred 
embodiments, an on-line automated, SPE system is utilized (e.g., including, but not 
limited to, the Prospekt automated SPE system; Spark Holland Instrumenten, The 
Netherlands). The advantage of on-Une SPE is the direct elution of the extract fiom the 
SPE cartridge into the second phase (e.g., LC system) by the LC mobUe phase. Several 

20 laborious handling steps are thus omitted, making on-line SPE much more efficient and 
providing si^erior analytical results. The superior analytical performance of on-line SPE 
is derived fiom the ehmination of eluate collection, evaporation, reconstitution and 
injection, thus.eliminating several major error sources. In addition, on-line elution 
transfers 100% of the purified analytes from the extraction cartridge into the LC (e.g., 

25 HPLC). This provides maximum precision and sensitivity, as well as reduced costs, thus 
saving solvents, gtessware, and labor time. In addition, samples and SPE cartridges are 
processed in a completely closed system, making sample tracking easy and protecting 
samples against Ught and air. It also protects the operator from contact with ha2ardous 
samples or solvents. Furthermore, less handling means fewer failures and high pressure 

30 solvent control for SPE makes the process independent of cartridge back pressure. 
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3. Separation in a Second Phase - 

In some preferred embodiments, fbUowing the first separation phase, products of 
the separation step are fed directfy into a second Uquid phase separation step. The second 
dimension separates proteins based on a second physical property (i.e., a different 

5 pioperty than the first physical property) and is preferably conducted in the Uqmdph^^^ 
(feg., liquid-phase size exclusion). For exanq»le, is some embodiments of ^ present 
invention, proteins are separated by hydrophobicity using non-porous reversed phase 
HPLC {See e.g., Liang et al.. Rap. Comm. Mass Spec., 10:1219 [1996]; Griffin et al, 
Riq). Comm. Mass Spec., 9:1546 [1995]; Opitedc et al.. Anal. Biochem. 258:344 [1998]; 

10 Kilssonei a/.. R^p. Comm. Mass Spec., 11:610 [1997]; Chen g<fl/.. Rap. ComnL Mass 
Spec., 12:1994 [1998]; Wall et al.. Anal. Chem., 71 :3894 [1999]; Chong et al.. Rap. 
Comm. Mass Spec., 13:1808 [1999]). 

This method provides for exceptionally fast and reproducible high-resolution 
sq)arations of proteins acasrding to their hydrophobicity and molecular weight. The 

15 non-porous (NP) silica packing material used in these reverse phase (RP) sqiarations 
eliminates problems associatedwithporosityandlowrecovery of larger proteins, as weU 
as reducing analysis times by as much as one third. 

hi preferred anbodiments, an automated on-line sample handling system utilized 
in the present invention fijlly integrates the second separationphase with the first 

20 separation step. The sample flows directly &om the first phase {eg., ion exchange) 
through a desalting step (e.g., SPE) to the second phase (e.g., NP-RP HPLQ- ^ 
preferred embodiments (e.g., those utilizing the Prospekt system) the HPLC column is 
integrated into the automated sample handling system. For example, a multi valve 
system can be utilized where valve-switching is used to bring the extraction cartridge into 

25 the HPLC system, hi some embodiments, a sample is passed through the second phase 
separation step (e.g., NP-RP HPLQ greater than one time (e.g., twice) in order to 
improve selectivity and resolution. For example, in some embodhnents, two different 
NP-RP-HPLC columns are utilized in tandem. The automation of protem separation 
increases efficiency and speed as weU as decreases sample loss orpotentid 

30 contamination that may occur Ihrou^ handling. 
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B, Protem Identification by Mass Spectroscopy 
Following separation in the first and second phase, the automated sample 
handling system transfers samples to fte mass spectroscopy step. The present invention 
is not limited to any one mass spectroscopy technique. Indeed, a variety of techniques 

5 are contemplated For example, techniques that find use with the present invention 
include, but are not limited to, ion trap mass spectrometry, ion trap/time-of-flight mass 
spectrometry, quadnqwle and triple quadrtqpole'mass spectrometry, Fourier Transform 
(ICR) mass spectrometry, and magnetic sector mass spectrometry. In preferred 
embodiments, the MS analysis is automated and is performed on-line. In some 

10 embodiments, the eluent &om flie second separation phase is split into two fi-actions. A 
fiaction of the efiQuent is used to determine molecular weight by either MALDI-TOF-MS 
or ESI oa TOF (LCT, NGcromass) (See e.g., U.S. Pat. No. 6,002,127), The remainder of 
flie eluent is used to detramine the identity of the proteins via digestion of the proteins 
and analysis of the peptide mass map fingerprints by either MALDI-TOF-MS or ESI oa 

15 TOF. The molecular weight 2-D protein map is matched to the appropriate digest 

fingerprint by correlating flie molecular weight total ion chromatograms (TIC's) with the 
UV-chromatograms and by calculation of the various delay times involved. The UV- 
chromatograms are automatically labeled wifli the digest fingerprint fi-action number. 
The resulting molecular weight and digest mass fingerprint data can then be used to 

20 search for the protein identity via web-based programs like MSFit (UCSF). 

A detailed discussion of the use of 3-D maps generated by the automated 
separation process of the present invention to identify and characterize proteins is 
provided in the above sections. In some embodiments, the present invention provides a 
3-D m^ in which the first dimension represents a first physical property (e.g., charge or 

25 isoelectric point), the second dimension represents a second physical property {e.g, , 
hydrophobicity or molecular weight), and the third dimension represents the molecular 
weight and relative abundance of proteins present in the sample. In some embodiments, 
the data fiom the 3-D protem map is used to search protein data bases in order to 
detOTnine the identity of the proteins. 

30 hi some ranbodiments of the present invention, sample analysis is automated and 

integrated with the caitralized control network. For example, mass spectroscopy data is 
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transferred to an integrated conq>uter system containing software for the gaieration of 3- 
D protein m^s. The mtegrated conq)Uter system is also capable of searching databases 
and generating a repmt. The report is provided to the operator in a format that is 
customized to the particular appUcation. For example, if an expaiment was designed to 
5 identify unknown components of a solution, the report identifies components of the 3-D 
map as particular proteins. Conversely, if an experiment is designed to compare ttie 
protein expression profiles of two samples, the report may identify proteins that are 
present in one san:q>le and absent in another or are present at different abundances 
between the two samples. 



10 



C. Automated Protein Separation and Characterization in Practice 
Illustrative Example 8 describes one particular embodiment of flie present 
invention where an automated on-line Prospekt system was used to separate a protein 
sample based on charge and hydrophobidty. Siberian Permafi-ost whole celllysate was 

15 first separated using a mini MonoQ anion exchange cohann. A graph of &e Mini Q 

column eluent is shown in Figure 17. Fractions (1 miiiute each) from the anion exchange 
column gradient were fed direcdy into the second step using the automated Prospekt 
system. The Prospekt then trsqjped the factions on 10 C4SPE cartridges. Each cartridge 
was washed wifli the reverse-phase HPLC starting buffer to remove residual salt. The 

20 Prospekt system integrates the HPLC and SPE steps with a mulli valve switching system. 
FoUowing the wash step, the eluent from the SPE cartridge was dixecfly transferred to the 

NP-RP HPLC column. 

The fractions were separated using a tandem column method. A gradient was 

appUed to the HPLC column. The HPLC column was then switched back to the initial 
25 buffer and allowed to equiUbrate. The eluent fiwm the first gradient is then passed 

through a second (different) HPLC column. The use of a second tandem column 

increases resolution and sdectivity. This step is repeated for each of the SPE cartridges 

(each representuig one anion exdiange fraction). 

Following separation by NP-RP-HPLC, protein fractions were analyzed online by 
30 MS to determine then- molecular weight and abundance. The eluent from the column was 

split into two fractions. One fraction is digested enzymatically before MS. Both the 
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digested and non-digested sample were analyzed by ESI oa TOF TIC (total ion count) 
mass spectroscopy. Total ion count profiles are shown in Figures 1 8A and 1 8B. 

EXPERIMENTAL 

5 The following examples serve to illustrate certain preferred embodiments and 

aspects of flie present invention and are not to be constnied as limiting the scope thereof 

EXAMPLE 1 
HEL Cell Sample Preparation 

10 The human eiythroleukemia (HEL) cell line was obtained &om the Department of 

Pediatrics at The University of Michigan. HEL cells were cultured (7% CO2, 37 *'C) in 
RPMI-1640 medium (Gibco) containing 4 mM glutamine, 2 mM pyruvate, 10 % fetal 
bovine serum (Gibco), penicillin (100 units per mL), streptomycin (100 units per mL) 
and 250 mg of hygromycin (Sigma), The HEL cell peDets were washed in sterile PBS, 

15 and then stored at SO^C. The cell pellets were then re-suspended in 0.1% n-octyl B-D- 
galactopyranoside (00) (Sigma) and 8 M urea (Sigma) and vortexed for 2 minutes to 
eflFect cell disnq)tion and protein solubilizatioiL The whole cell protein extract was flien 
diluted to 55 mL with tiie Rotofor buffer and introduced into the Rotofor separation 
chamber (Biorad). 

20 , 

EXAMPLE2 

1-D Gel and SDS PAGE Separation 

HEL cell proteins, resolved by Rotofor separation into discrete pi ranges, were 

ftirther resolved according to their apparent molecular weight by SDS-P AGE. This 

25 procedure takes approximately 14 hours to complete. Samples of rotofor firactions were 
suspended in an equal volume of sample buffer (125 mM Tris (pH 6.8) containing 1% 
SDS, 10% glycerol, 1% dithiothreitol and broniophenbl blue) and boiled for 5 min. They 
were then loaded onto 10% acrylamide gels. The samples were electrophoresed at 40 
volts until the dye front reached the opposite end of the gel. The resolved proteins were 

30 visualized by silver staining. The gels were fixed overnight in 50% ethanol containing 
5% glacial acetic acid, then washed successively (for 2 hours each) in 25% ethanol 
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containing 5% gladal acetic acid, 5% glacial acetic acid, and 1% glacial acetic acid. The 
gels were impregnated witti 0.2% silver nitrate for 25 min. and were developed in 3% 
sodium carbonate containing 0.4% formaldehyde for 1 0 min. Color development was 
terminated by inpregnating the gels wifli 1% glacial acetic acid, after which flie gels 
.5 were digitized. 

EXAMPLES 
2-DPAGE 

In order to prepare protein extracts fiom the HEL cells, the harvested cell pellets 
10 were lysed by addition of three vohimes of solubilization buffer consisting of 8 M urea, 
2% NP-40, 2% carrier ampholytes (pH 3.5 to 10), 2% B-meic^toethanol and 10 mM 
PMSF, a&BC which the buffer containing the cell extracts was transferred mto 
microcentrifuge tubes and stored at 80°C until use. 

Extracts of ttie cultured HEL cells were separated ia two dimensions as previously 
15 described by Cbsa et al (Chen et al. Rap. Comm. Mass Spec. 13:1907 [1999]) with 
some modifications as described below. Subsequent to cellular lysis in solubilization 
buffer, the cell lysates fiom approximately 2.5 X 106 cells were appHed to isoelectric 

focusing gels. Isoelectric focusing was conducted using pH 3.5 to 10 carrier ampholytes 
(Biorad) at 700 V for 16 h,foUowed by 1000 V for an additional 2 hours. Ihe first 

20 dimension tube gel was soaked in a solution of 2 m©toL of ditinoerythritol (DTE) for 10 
minutes, and then soaked in a solution of 20 mg/mL of iodoacetamide (Sigma) for 10 
minutes, both at room temperature. The first-dimension tube gel was loaded onto a 
cassette containing the second dimension gel, after equiHbration in second-dunension 
sample buffer (125 mM TWs (pH 6.8), containing 10% glycerol, 2% SDS, 1% 

25 dilhioerythritol and bromophenol blue). For the second-dimerision separation, an 

acrylamide gradient of 1 1.5% to 14% was used, and the samples were electiophoresed 
until the dye ftont reached the opposite end of the gel. The separated proteins were 
transferred to an hmnobilon-P PVDF membrane. Protein patterns in some gels were 
visualized by silver staining or by Coomassie blue staining, and on Immobilon-P 

30 membranes by Coomassie blue staining of the mranbranes. 
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EXAMPLE 4 
Rotofor Isodectric Focusing 
A preparative scale-Rotofor ^iorad) was used in the first dimension separation. 
This device separated the proteins in liquid phase according to their pi, and is capable of 

5 being loaded with up to a gram ofprotein, with the total buffer volume being 55 mL. 
Alt«natively, for analysis of smaller quantities ofprotein, a mini-Rotofor with a reduced 
volume can be used. These proteins wwe separated by isoelectric focusing ovct a 5 hour 
period where tiie separation ten5)erature was 10''C and the separation buffer contained 
0.1 % n-octjd B-D-galactopyranoside (OG) (Sigma). 8 M urea (ICN), 2 % fi- 

10 mercaptoethanol (Biorad) and 2.5 % Biolyte ampholytes, pH 3.5-10 (Biorad). The 

procedure used for running the Rotofor (Rotofor Purification System, Biorad) was of the 
standard procedure described in flie manual from Biorad as modified herein. The 20 
fiactions contained in tiie Rotofor were collected simultaneously, into separate vials using 
a vacuum source attached by plastic tubing to an array of 20 needles, which were 

15 punched through a septum. The Rotofor fractions were aUquotted into 400 jiL amounts 
in polypropylene microcentrifuge tubes and could be stored at 80»C for fiirther analysis if 
necessary. An advantage of gel methods is the ability to store proteins stably in gels at 4 
•C for further use. The concentration ofprotein in each fraction was determined via the 
Biorad Bradford based protein assay. The pH of the fractions was detemiined using pH 

20 indicator pq)er (Type CF, "Whatman). 

EXAMPLES 
NPRPHPLC 

Separations were perfonned at a flow rate of 1 .0 mL/minute on an analytical (4.6 
25 • l4nim)NPRPHPLCcolumncontainingl.54iniC18(ODSI)non-poroussilicabeads 
(Micra Scientific Inc.). The column was placed in a TimberUne column heater and 
maintained at 65»C. The separations were perfonned using water/acetonitrile (0.1% 
TFA, 0.05 % OG) gradients. The gradient profile used was as follows: 1) 0 to 25% 
acetonitrile (solvent B) in 2 minutes; 2) 25 to 35% B in 2 minutes; 3) 35 to 45% B in 5 
30 minutes; 4) 45 to 65% B in 1 minute; 5) 65 to 100% B in 1 minute; 6) 100% B m 3 

minutes; 7) 100 to 5% B in 1 minute. The start point of this profile was one minute into 
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the gradient due tp a one-minute dwell time. The acetonitrile was 99.93+% HPLC grade 
(Sigma) and the TFA were fix)m 1 mL sealed glass ampules (Sigma). The non-ionic 
detergent used was n-octyl fi-D-galactopyranoside (OG) (Sigma). The HPLC instrument 
used was a Beckman model 127s/166. Peaks were detected by absorbance of radiation at 
5 214 nm in a 15 (iL analytical flow cell. 

Protein standards (Sigma) used as MW protein markers and for correlation of 
retention time, molecular weight and hydrophobicity were bovine serum albumin (66 
kDa), carbonic anhydrase (29 kDa), ovalbumin (45 kDa), lysozyme (14.4 kDa), trypsin 
inhibitor (20 kDa) and -lactalbumin (14.2 kDa). 

10 

EXAMPLE 6 
MALDI-TOF MS of NP RP HPLC Isolated Proteins 
The MALDI-TOF MS analyses were performed on a Perseptive Voyager* 
Biospectrometry Workstation equipped with delayed extraction technology, a one-meter 
15 flight tube and a high cunrent detector. The N2 lasCT provided Ught at 337 nm for laser 
desorption and ionization. MALDI-TOF MS was used to determine masses of peptides 
from protein digests using a modified (described horein) version of the two layer dried 
droplet method of Dai et al (Dai et al. Anal. Chem., 71 :1087 [1999]). The MALDI 
matrix .cyano-4-hydroxy.cinnamicacid( -CHCA) (Sigma Chemical Corp., St Louis, 
20 MO, USA) was prepared in a saturated solution of acetone (1% TFA). This solution was 
diluted 8-fold in the same acetone solution (1% TFA) and then added to the sample 
droplet in a 1 :2 ratio (v:v). The mixed droplet was then allowed to air dry on the MALDI 
plate prior to introduction mto the MALDI TOF instrument for molecular weigjit 
analyses. 

25 The proteins were collected into 1 .5 mL polypropylene micro-tubes containmg 20 

HL of 0.8 % OG in 50 % ethanol. In preparation for enzymatic digestion the acetonitrile 
was removed via speedvac at 45*^0 for 30 minutes. A solution of 200 mM NH4HCO3 
(ICN) / ImM fl-mercaptoethanol was then added in a 1 to 2 ratio to the remaking 
solution in the tubes, resulting in a solution of 50 to 100 mM NH4HCO3 witii a total 

30 volume of approximately 150 jiL. Subsequentiy 0.25 |ig of enzyme was added to this 
solution and then the mixture was vortexed and placed in a 37^0 warm room for 24 
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hours. The enzymes used were either trypsin (Promega, TPCK treated), which cleaves at 
the caiboxy side of the arginine and lysine residues, or Glu-C (Promega), which in 50 - 
100 mM NH4HCO3 solution cleaves at the carboxy side of the glutamic acid residues. 
The digest solutions were typically 100 nL in volume and 30 to 50 jiL of this 
5 solution was desalted and concentrated to a final volume of5|iL using Zip-Tips 
(Millipoi«)with2 |iLC18resinbeds. The purified peptide solution was then used to 
spot onto the MALDI plate for subsequent MALDI-TOF MS analysis. All spectra were 
obtained with 128 averages and internally or externally caKbrated using the PerSeptive 
standard peptide rnixture containing angiotensin I, ACnH(l-17), ACTHO 8-39) 

10 ACTH(7-38) (PerSeptive Biosystems). 

These digests were then used to aid in the identification of the proteins by 
MAIDI-TOF MS analysis and MSFit database searching (Wall et aL, AnaL Chem., 
71 :3894 [1999]). The peptide mass maps were searched against the Swiss and NCBInr 
protein databases using MSFit allowing for 2 missed cleavages. The molecular weight 

15 ranged-from 5 kDa to 70 kDa and the pi ranged over die full pi range. Externally 
cahT)rated peptide masses were searched with 400 ppm mass accuracy and internally 
caKbrated peptide masses were searched with 200 ppm mass accuracy. 

EXAMPLE 7 

2Q Chrbmatofocusmg 

In one exemplary embodiment of the chromatic focusmg techniques of the present 
invention, proteins are extracted fix)m cells using a chemical lysis procedure. The lysis 
buffer consists of SMguanidin^hydrochloride, 20 mMnK)ctyi -D-glucopyranoside and 

50 mM Tris. Cells are vortexed rigorously and Jcept ovemight.at 20 OC. They are 
25 subsequentlyc6ntrifugedatl7,000rpm:for20mift. The supematantis removed firom the 
ceUdebrisandre-centrifugedathighspeedtofurfherremoveanyparticulate. Forthe 
bestrt!producibleresults,lysatfeisbestusedwithin48hrs. Buffers for this CF are (A) 
hnidazole-HAC, 0.1% guamdine-hydrochloride. 0.05% n-octyl -D-glucppyranoside, 
pH 7.2, and (B) Polybuffer 74 (dUuted 1:10), 0.1% guanidme-hydrochloride, 0.05% n- 
30 octyl -D-glucopyranoside, pH 4. Ihe CF column in this example is Mono P HR 5/20 
(AmershamPhannada. Uppsala. Sweden) wilhaflowrate of ImUmin at room 
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temperature. Prior to injection lysate is equilibrated wifli buffer A with a loading time of 
20 min. The sample loadability for this CF column is 10 mg of protein. The separation 
profile is monitored at 280 ma while the pH gradient is monitored using a pH flowceU 
meter, also from Amersham Pharmacia. 

5 The CF cohmm is equilibrated with buffer A to define the upper pH range (7 in 

this case) of the pH gradient The second focusing buffer B is then appHed to elute bound 
proteins, in the order of their isoelectric (pi) points. The pH of buffer B is 4, which 
defines the lower limit of the pH gradient The pH gradient is formed as the elating 
buffra- B titrates the buffering groups on the ion-«cchanger. 

10 Hie pl-focused liquid fractions fmm CF are analyzed in the second dimension 

using NP-RP-HPLC. Non-porous RP-HPLG columns (Eichrom Technologies, Darien, 
IL, USA) are used as the second orthogonal separation dimension after CF in order to 
obtain a 2-D protein map that is capable of competing with 2-D gel. These columns are 
exceUent for protein separation due to their high protein recovery, speed and efficiency. 

15 To achieve optimal protein separation, the columns should be kept at a high temperature 
(e.g.,60OC). This elevated temperature also inq)roves selectivity. Selectivity as well as 
resolution can also be enhanced by using multiple NP columns in series. RP-HPLC 
columns packed with non-porous siUca beads (Eichrom Technologies) such as ODSl, 2 
and 3 are all well suited for these tasks. 

20 Proteins that elute fromNP-RP-HPLC separation can be direcfly analyzed by MS 

to determine their molecular wdght, identity and relative abundance. In this case the 
eluted proteins are sized simultaneously by ESI-oaTOF MS (LCT, Micromass, 
Manchester, UK). Hie o&er part of the eluted protems bom the spUt valve can be 
collected using a fraction collector for enzymatic digestion to xibtain peptide maps with a 
25 MALDI-TOF MS, ESI-QIT-reTOF MS, or ESI-oaTOF MS (W^T). Information such as 
file molecular weight, and peptide map of aprotein can then be entered into a web- 
based protein database program such as MS-Fit (e.g., http://prospector.ucsf.edu) for 
protein identification. 

2Q Examples 

Automated 3-D IE NP-RP-HPLC-ESI-oa TOF MS 
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This exairple describes an automated system for protein separation and 
identification based on charge, hydrophobicity, and mass. Protein samples are separated 
based on charge using an ion exchange (EE) column. Protein firactions are then tr^ed 
on a soUd phase extraction (SPE) column for desalting using an automated Prospekt 
5 system. The Piospedrt system then directs the protein firactions to a noiq)orous-rev*se 
' phaseHPLC column (NP-RP-HPLC). The samples are then identified using ESI oa TOP 
mass spectroscopy. 

A. Protein Separation and Trapping by SPE 

10 Siberian Pennafiost whole ceU lysate of sample 23-9-25 (obtained from Jim 

Tiendje, Department of Microbial Ecology, Michigan State University) was lysed using a 
chemical lysis procedure. The lysis buffer contained 6M guanidine-HCL, 20 mM n-octyl 

-D-glucopyransoside and 50 mM Tris. The cells were vortexed vigorously and stored 
overnight at O'C. The cells were then centrifiiged at 17,000 tpm for 20 minutes. The 

15 supernatant was removed &om the cellular material and then mixed 1 :1 with an 
equilibration buffer form (10 mMKH2P04, 5% MeOH, 0.1% n-octyl 

glucopyrandside,pH8). The sample was then injected into a Mini Q anion exchange 
cotamn (Amersham Pharmacia. Uppsala, Sweden) with a flow rate of 1 mlAnin at ITC. 
EquiUbration buffer was run through the column for 3 ininutes, followed by a 0% to 
20 100% gradient ofbtifferB (10 mMKH2P04,5%MeOH, 0.1% n-octyl -D 

glncopyranoside. IM NaCl, pH 7) in 15 minutes, A grsqjh of tiie Mini Q column eluent is 
shown in Figure 17, 

Fractions (1 minute each) are each collected on a separate soUd phase extraction 
(SPE) cartridge by directing the eluent firan«ie IE through lO C4:SPE cartridges, A 
25 Prospekt on-line automated SPE system (Spark Holland Jtastrumenten, The Neflierlands) 
was utilized for the SPE, HPLC, and MS phases, 

B. Protein Purification and Separation by NP-BP-HPLC 
The initial mobile phase buffer for the RP analysis was 5 % buffer B (0,1% TFA 
30 in ACN) in buffer A (0,1 % TFA in H2O). IWs solution was directed through the SPE 
cartridge until all the residual salt fixnn the anion exchange mobfie phase was removed. 
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The eluent fiom the SPE cartridge was next directed by the Prospdct system directly to a 
HPLC for the second orthogonal separation phase. 

Non Porous-RP colirams (Eichrom Technologies, Darien, IL) were used as flie 
second separation phase. A tandem colunm method was employed. ODSEIE and ODSI 

5 NP RP HPLC columns (Eichrom Technologies, Darien, IL) contained 1 .5 jim C18 
(ODSI) non-porous silica beads. Column dimensions were 4.6 * 33 mm (ODSIDE) and 
4.6 * 14 mm (ODSI). The columns were maintained at eO'C to improve selectivity. A 
flow rate of 0.5 mL/min at a pressure of 5000 psi was maintained. The columns were 
loaded, equilibrated in the initial buffer, and the gradient was started. A gradient of 

10 buffers (0.1% TFA in ACN) was performed as follows: 5% B for 1.5 min, 5% B to 20% 
B in 2 jnin, 20% B to 35% B in 5 min, 35% B to 60% B in 15 min, 60% B to 100% B in 
5 minutes. The eluent from the first HPLC column (ODSI)-was directed into the second 
HPLC column (ODSniE). 

Following the gradient, the initial mobile phase buffer was run through ttie RP 

15 cohram until a stable baseline is realized. The HPLC step was repeated for each of the 
SPE cohnnns (eadi of which contained a 1 minute fiaction fi;om the anion exchange 
column). 

C. Protein Identification by Mass Spectroscopy 

20 Following separation by NP-RP-HPLC, protein fractions were analyzed online by 

MS to determine their molecular wei^t and abundance. Samples were analyzed by ESI 
oa TOF TIC (total ion count) mass spectroscopy. Mass spectroscopy conditions were as 
follows: capillaiy 2900V, sample cone 45V, extraction cone 3V, RF lens lOOOV, 
desohxtion temp or 350°C, and source tenqi of 120°C. 

25 Results of tiie ESI oa TOF TIC analysis are shown in Figures 18A and B. Figure 

18A shows the total ion profile of the fraction coUected from 3 to 4 of the MiniQ column; 
figure 18B shows the total ion profile of the fraction collected fiiom 7 to 8 minutes. 

Example 9 

30 Automated Protein Separation and CE-MS/MS Analysis 



72 



wo 2004/063719 PCT/US2004/000774 

This method identifies individual proteins in a mixture. They are initially 
physically separated ftom each other and flirai each is digested into a set of peptides. 
Those peptide mixtures are each analyzed by mass spectrometry techniques to obtain the 
protein identification. The physical separation process is a sequence of two methods (two 
5 dimensions). The first is separation by protein pi (pH). The second is a chromatographic 
separation. 

Several mettiods are available to accomplish the separation by pi. The 
commercial Rotofor system is an all liquid phase method which uses an electric field with 
carrier ampholyte to build up a pH gradient. The commercial IsoPrime system uses 
10 membranes to build up the pH gradient. Another approach is to utilize carrier ampholye 
to establish a pH gradient on the stationary phase of a chromatography column and then . 
separate the proteins by elution off the column with a polybuffer. While separation of 
proteins does occur in these methods, there are still many proteins within a given pi 
range. In some embodiments, they are then separated fiom each other by another method 
15 (the second dimension). Proteins are coUected from first two pi separation methods by 
collecting tiie regions that physically exist. 

In the latter chromatography method, the proteins are collected into individual 
tubes as they elute off the column. It is desirable to coUect a small pH range (typically 
0.2 pH units) into eadi tube. pH is continuously monitored using a pH electrode placed 
20 at the output of the column. The electrode's voltage is digitized and measured by custom 
software developed to monitor and record the pH and control the tube changes based on 
user specified pH ranges. A custom tube holder and mechanical changing system has 
been developed. Up to 100 tubes with tops can be held on a one square foot plate. The 
chromatography column output tubing is held above the tubes and its eluent drips down 
25 into a specific tube. A stepper motor attached to tiie platctube holder indexes the desired 
tube under the column output upon commands from the pH monitoring software. 
Alternatively, the software sends control signals to commercial fraction collectors. This 
provides an automated and reproducible first dimension collection scheme. 

Each small pH range wiU stUl contain many proteins. They are further separated 
30 in the second dimension using chromatography methods. As they elute ofFthe column, 
their presence is detected by ultraviolet light absorption. The separated proteins are 
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collected into individual tubes using the^ame custom designed collector described above 
for pH elution or using a commercial fraction collector. The custom pH change software 
also monitors and records the UV absorbance spectra. When the user visually determines 
that a protein is eluting (typically by observing the emergence of a peak in the UV 
5 spectra), they use the software to send a signal to change collection tubes. The software 
records the UV spectra and simultaneously an indication of into which tube the protein 
was collected. Methods are available for automatic peak detection and hence automation 
of this second dimension separation and collection. 

In some cases (iu order to compare protein mixtures from two cell lines for 

1 0 example), the first dimension separation by pH may be collected not at fixed pH 
increment ranges, but rather based on UV absorption of the eluting proteins. The 
fictions are collected on the basis of the presence of proteins ia the eluent instead of at 
equally spaced pH changes. The pH range of each collected fraction is recorded by the 
custom software. The same ranges are then chosen as fraction collector trigger points for 

15 another protein mixture first dimension separation by pH gradient. The pH is monitored 
and when it exceeds the next range collected in the previous separation, the fraction 
collector is triggered to change tubes. Thus the second sample's fractionation occurs 
with pH ranges identical to the first sample. 

The first dimension separation does not have to result in collection into individual 

20 tubes and subsequent injection of those individual fractions onto another colunm for 
second dimension separation. In some embodiments, the two separations are directly 
coupled in an automated process. A defined portion or pH range of the eluent fix)m the 
first column is held up or loaded onto a blocking colunm. When the desired pH range has 
been loaded, computer controlled valves are used to switch the first dimension pH 

25 separation eluent onto another blockmg colunm to begin loading the next pH range. The 
first blocking column is switched to a buffer that elutes its collected proteins onto a 
column used for second dimension separation. Thus by usmg several columns and 
appropriate valve switching, the entire two-dimensional physical separation process is 
automated. 

30 The individual proteins now mostly physically separated from each other are 

analyzed by mass spectral methods to determine their identification* Each final protein 
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fraction is directed to an LCT commercial mass spectrometry system to obtain its intact 
molecular weight This will not be able to indicate however if the protein has been 
modified 

The protein is digested by enzymatic cleavage into a mixture of small peptides. 
5 This mixture is then analyzed by MALDI mass spectrometry to obtain a peptide mass 
fingerprint No fragmentation occurs, Wh«i combined with the intact molecular weight 
information from the LCT data, accurate protein identification can be made with high 
confidence. 

Alternatively, in preferred embodiments, the peptide digest is separated by . 

10 capillary electrophoresis and each peptide identified by on-line by mass spectrometry 
(CE/MS). Each peptide is fiagmented and its amino acid sequence detemMned. This 
leads to very accurate identification of the protein with high confidence. 

There will be 1 5 fi^ctions if separated into a range of 0.2 pH each over a total of 3 
pH units in the first dimension. With 75 proteins in each fraction there are 1 125 proteins 

15 to be digested and analyzed by CE/MS . Custom software has beai developed to 

automate the CE/MS process. It controls a sample injector, high voltage power supply 
for CE sample loading and running, CE column/electrode mechanical positioning for 
loading and rurming and overall sj^tem timing. Data from the time of flight mass 
spectrometer is digitized and recorded by computer with the custom software. The 

20 software provides representation of the data as a two dimensional image for visual 
identification of important mass spectral peak locations. 

In some embodiments, CE/MS/MS is performed. The desired ion of interest is 
first isolated and thm fragmented. The ions in the eluting peak are confined to an ion 
trap and a waveform consisting of all frequencies except for the frequency resonance 

25 with the desired ion is applied to the.cell. This notched waveform excludes all 

interferences from the ion trap. Then, a low amplitude frequency that is resonant with the 
desired ion is applied to partially fragment it. When subsequently ejected from the ion 
trap, both parent and fragment ion information is obtained free of background. In order 
to know the appropriate frequency to apply to the ion trap, the mass-to-charge ratio of the 

30 desired ion must be known. Custom software has been developed to determine this in 
real time as a peak begins to elute. The notched waveform is immediately calculated and 
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applied to the ion trap so that background-free spectra can be acquired over as much of 
the peak as possible. This capability avoids the necessity of performing two injections 
witti the first done to obtain the desired ion-of-interest infomation. This more than 
doubles the throughput. The amphtude of the frequency resonant with the desired ion-of- 
5 interest is feedback adjusted in real time from spectra to spectra to optimize the parent to 
fragment ion ratio. 

Example 10 
CE-MS/MS Analysis of CaldCLl 
10 This Example describes the analysis of CaldCLl, a fiiUy malignant breast cancer 

cell line. Cells were separated by rotofor and chromatography, followed by tryptic 
digestion for CE-MS/MS analysis through database searching. The high efficiency of 
this method was demonstrated and discussed based on increased peptide coverage yield 
compared to other mass spectrometric methods. 

15 

A. Mefliods 

Figure 21 shows an overview of the method utilized. A Mini-Rotofor (17 mL 
edacity) in pH range of 4 to 9 was utilized for tiie first separation phase. The 
ampholytes used were 1 mL of 40% Bio-Lyte 3-10. The rotofor was run at 12 W for 3.5 

20 hrs. A MICRA-Platinum nonporous reversed phase CI 8 column (1 .5 pm, 33 x 4.6 mm) 
from Eichrom was nsxX used to separate the mixture in a second phase. 
The HPLC fractions were next digested and processed for CE-MS/MS. Briefly, a protein 
to enzyme ratio of approximately 1 :30 to 1 :40 was used. 4 pL of 0.5 jig/mL trypsin was 
added with 40 \xL of 50 mM NaHCO 3 digestionbuffer at pH 7.8. The digestion mixture 

25 was incubated at 3TC for approximately 15.hrs. The digested sample was flien dried 
down by speed vacuuming and reconstituted with 1 ml of DI water and stored at -20°C 
until analysis. 

For CE analysis, a capillary column with dimensions of 50 cm long, 150 |im 
OD/50 \xia ID was utilized. The running buffer solution was 50mM ammonium formate 
30 at pH 2.7. An electrokinetic injection at 1 to 3 kV for 10 to 60 sec, followed by a voltage 
gradient of approximately 240 V/cm during the run was used. A positively charged inner 
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wall wifli multiple coating procedure was utilized (See Figure 22). A capillary tip coated 
witii silver for sheathless ESI was ako utilized. The multiple coating of flie capillary 
column was performed as follows. Each step was performed under N2 (g), followed by 
DI water rinse for 5 min. The steps were: 
5 1. IMNaOH rinse for 30 min 

2. 10% polybrene (w/v) / 3% ethylene glycol (v/v) rinse for 30 min 

3. 6% dextran sulfate rinse for 30 min 

4. 10% polybrene (w/v) / 3% ethylene glycol (v/v) rinse for 30 min 

Figure 23 shows a flow chart of the CE-MS/MS instrmnentation and procedures utilized. 

10 

B. • Results 

Figure 24 shows a representative chromatogram of an RP HPLC run. Figure 25 . 
shows a representative 3D profile of a CE-MS run. Figure 26 shows a CE-MS elution 
profile of a tryptic digest. Figure 27 shows a MS/MS spectrum of a tryptic digest of heat 
15 shock protein. Figure 28 shows a table of theoretical and experimental pPs and MW of 
proteins identified. Figure 29 shows a comparison of coverage between different MS 
methods. 

In conclusion, several proteins in malignant breast cancer cell line were 
successfiilly identified by an on-line cq)illary electrophoresis - tandem mass spectrometer 

20 method with higiher peptide coverage than other mass spectrometry methods. The 

capillary column generated by multiple ionic coating procedure demonstrated increased 
stability. The on-line cq)illaiy electrophoresis-tandem mass spectrometry for analysis of 
proteome methods described resulted in the collection of 2-D Data of total ion count and 
MW during electrophoretic separation using MS/MS with SWIFT technology. The 

25 tryptic digest of cancer cell line proteins for identification by database searching resulted 

in increased covcarage. 

Example 11 

Microarray of Cancer Proteins Produced UtOizing 
Chromatofocusing and HPLC Techniques 
30 In one exemplary raibodiment of the present invention, a microarray of cancer 

proteins is produced utilizing chromatofocusing and HPLC techniques. 
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In order to develop a microairay of cancer proteins, prostate cancer cell line, 
LnCAP, was used. LnCAP cells were cultured in RPMI medium supplemented in 1 00 
mm tissue culture dishes (Becton Dickinson, Franklin Lakes, NJ). Log phase cells were 
harvested, washed in PBS and pelleted. The LnCAP cell pellets were lysed and used for 
5 fractionation. Cell pellets were reconstituted in lysis buffer. The lysis buffer consisted of 
7M urea, 2M thiourea, lOOmM DTT, 0.5% biolyte 3-10, 2% octyl glucoside and ImM 

PMSF. The cell pellets were lysed at room temperature for half an hour, followed by 
centrifixgation at 35,000 ipm at 4 degrees C for 1 hour. The supernatant was stored at 80 
degrees C for future use. A PD 1 0 column, equilibrated with 25iiiM bis-tris in 6M urea 
10 and 0.2% octyl glucoside, was used to exchage the cell lysate from the lysis buffer to the 
above buffer. 

LnCAP cell lysates were next fractionated usiiig a pi gradient by 
xhromatofocusing at 0.2 pi mtervals. Two buffers, a start buflfer and an elution buffer, 
were employed. The start buffer was 25mM bis-tris with pH 7.1, and tixe elution buffer 

15 was polybuffer74:water in a ratio of 1 : 10 with pH 4.0. Both buffers were prq)ared in 6M 
urea and 0.2% octyl glucoside. Iminodiacetic acid was used to adjust the pH of both 
buffers. The PS-HPCF ID column was equilibrated with the start buffer until the pH of 
the effluent was 7.1. Sample was applied to tiie column with multiple injections. Once a 
stable baseline was acMeved, the elution buffer was switched on to elute the protei^^ 

20 the column in an isocratic mode, UV detection was performed at 280 rail and the pH of 
the effluent was monitored using a flow-through online pH probe. The pH fractions were 
collected in 0.2 pH intervals and 1 5 fractions in total were collected in the range of pH 7- 
4. The CP separation was stopped when ttie pH of the effluent reached 4. 

Each of these pi fractions were collected and separated in the second dimension 

25 by NPS RP-HPLC. RP-HPLC was performed using PS-HPCF 2D <4.6 X 33mm) 

columns. Solvent A was 0.1% TFA in water and Solvent B 0.1% TFA in acetonitrile. The 
gradient was run from 5% to 15% in Imin, 15% B to 25% in 2 min, 25% to 31% in 2 
min, 31% to 41% in 10 min, 41% to 47% in 6 min, 47% to 67% in 4 min, finally up to 
1 00% B in 3 min and held for another 1 min, then back to 5% in 1 min at a flow rate of 

30 Iml/min; The column temperature was 40° C higher than the ambimt ten5>erature, UV 
absorptions wk« monitored at 214 nm. RP fractions were collected using a fraction 
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collector. The fractions were dried down completely by Speed-Vac at 75" C and stored at 
- 80*^ C until further use. 

A profile of the total fractionation from pH 4-7 is shown in Figure 31. Some of ' 
the fractions obtained after chromatofocusing contained as many as 70-90 proteins. As a 

5 representative example the different proteins separated by NPS RP-HPLC from the pi 
fraction 5 are shown in Figure 32, It is estimated that the sample was fractionated into as 
many as 1400 protein bands. Many of these bands contain single proteins, but some 
contain more lhan one protein. The presence of one or more proteins coiild be confirmed 
by MW measurements using ESI-TOF MS. Nevertheless, this method provides cellular 

10 proteins with a relatively high degree of purity, which can be used for microanay 
experiments. The relative amount of protein collected in each fraction from the CF 
column and loaded onto the HPLC separation, as detennined by the Bradford Protein 
assay, is shown in Figure 33. 

More proteins are usually recovered from the mid-pH fractions than towards the 

1 5 end of the pH range as expected. In these experiments using analytical columns, 
generally up to 5 mg of protein could be loaded on the first dimension without 
overloading the column and losing significant resolution. The total recovery of protein 
from the first dimension has been found to be around 5 1% over the range of pH 4-7. The 
latter can be increased by extending the pH range to 8.5. In addition, some acidic proteins 

20 (with pi below 4) and highly hydrophobic proteins might be lost during the fractionation. 
It is contemplated that an additional salt wash and isopropanol wash provides a means to 
recover these proteins. Nevertheless, on average it is expected <hat around ISOjig total 
protein is recovered from each pi fraction. The proteins from each pi fraction are fiirther 
separated by NPS RP-HPLC, into about 70-80 fractions with high degree of purity. The 

25 recovery of proteins in each fraction by NFS KP-HPLG depends on the protems but on 
average is around 80 %. The result is that on average around 1.5 }ig of protein is 
collected from each protein band. This corresponds to approximately 12 pg of protein or 
0,05 femtomole for a 30 kDa protein. The sensitivity of the anays can be inweased ten 
fold by using a more concentrated protein solution as baits. Importantly, this method 

30 could be adapted to preparative scale such that large amounts of material would be 
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available for constmctmg many.protdn arrays. In Has present work approximately 100 
micioairays were spotted from each collection. 

The targeted UV peak in the 2"* dimensional RP-HPLC chromatogram was 
collected and aliquoted into 3 fractions. One portion was dried down con^letely and used 
5 for microairay experiments. A second portion was dried down completely and 
reconstituted in 50% ACN with 2% fonnic add. The resulting solution was directly 
infused into an ESI-TOFMS (LCI, Micromass, foe.) at lOulAmn to detemune mtact 
molecular weight values of the proteins in the fraction. The other portion of the liquid 
band was dried down to eliminate ACN and TFA. IMNH4HCO3 and 10 mM DTT were 
10 then added to a final concentration of lOOmM and ImM, respectively. A sequence- 
specific endopeptidase such as trypsin was then mixed with Ihe. denatured proteins at the 
' ratio of 1 :50. The mixture was mcubated at 37 °C for 24 hours. MALDI-TOFMS 
(Micromass, Inc., T0FSpec2E) was subsequently employed to analyze peptide mass 
fingerprints (PMF), which were Ihen used to search for Ihe registered peptide masses of 
15 proteins in the database. The intact molecular weights from LCT and the PMF obtained 
using MALDI-TOFMS were together used to identify the protein in Ihe database. Further 
confiimation of the protein identity was obtained by MS/MS sequence analysis using the 
MAIX)I QTOF MS (Micromass, Inc.). 

The fractionated proteins were resuspended in 25 \i\ buffer containing 125 mM 
20 Tris-Cl (pH= 6.8), 4 % SDS and 2 % B-meicaptoethanol (Sigma, St Louis, USA). This 
was the minimum volume required for spotting in the initial work. The samples were 
denatured by heating at 70 »C for 2h followed by a final denaturation at 100 °C for 10 
min in a thermal cycler (Peridn Ehner Life Sciences, Boston, MA). The samples were 
transferred to a 96 well microtitre plate (MJ Research, Waltham, MA) and printed on 
25 nitrocellulose slides (Schleicher & Schuell, Keene, NH) using a GMS 417 microarray 
printer (Genetic MicroSystems, Wobum, MA), fo later work the spotting was performed 
using Magna Spotter microairay printer (Bioautomation). Using this system the spotting 
volume of the protein was reduced to 2.5 uL, allowing the ability to use concentrated 
fractions as baits. Each spot measured approximately 300 urn in diameter and was placed 
30 1200 vm apsA. The plates used to print are 96 weU V bottom plates from Major 
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Sdentifio. The slides were eiliier used iimnediately or stored at 4 'C. They were dried for 
i h at room temperature before farther processing. 

As a representative example, a slide-containing proteins fiactionated in the pH 
range of 4-7 were probed with either serum from healthy individuals or from ssx& from 
5 prostate cancer patients. Figure 34 shows one such representative example of the array 
containing 96 fractions separated in the pH range of 6-6.2. linmunoreactivitywas 
observed with specific fractions when probed with prostate cancer seram (Figure 34 A, 
arrow) which was not seen in senmi from healthy subjects (Figure 34 B). This indicated 
thepresenceof auto-antibodiesinserumfiomprostatecancerpatients against specific 

10 cancar associated proteins. 

The humoral response was next investigated. The humoral response generated 
againstspecificcancerproteinsusingalargerpoolofserumsamples. NitroceUuose 
sHdes containing spotted proteins were blocked in TBS containing 1% non fat milk and 5 
% normal donkey serum (Sigma, St Louis, USA) in 0.1% Tween-20 (Sigma, St Louis. 
15 USA), for 1 h at RT. Hie sUdes were rinsed with TBS and incubated with either serum 
from prostate cancer patients or normal individuals (1 :50) in a microcaster chamber 
(Schleicher & Schuell, Keene, NH) for 2 h at RT. They were washed six times with TBS- 
T (TBS containing 0.1% Tween-20), each for 5 minutes. The slides were then incubated 
with biotin-conjugated anti-human IgG (1:200, Jackson hnmunoResearch Laboratories, 
20 West Grove, PA) for 30 min at RT. They were then washed with TBS-T and incubated 
with Cy5-conjugated Streptavidin (1:250, Jackson hnmunoResearch Laboratories, West 
Grove, PA) for 30 min. The sUdes were finally washed and dried as described above and 
analyzed using a micioarray scanner (Axon Instruments Inc., Foster City, CA, USA). 

Figure 35 "shows the profile of auto-antibodies against fracti<m B6, firactionated at 
25 pH 6.8-7. A specific repertoire of antT)odie8 against fi^on B6 was found in serum from 
prostate cancer patients while negUgible immunorcactivity was observed using serum 
from normal individuals (Figure 35 A). Quantitation of fee data revealed 8 out of a total 
19 PCA (prostate cancer) sera reacted with firaction B6 giving a sensitivity of 42% while 
only 1 out of the 15 normals showed reactivity, giving a specificity of 93 %. Such high 
30 degree of specificity makes fraction B6 ausefid candidate to identify prostate cancer 
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. samples. This is especially so since prostate specific antigen (PSA), the routine marker 
used in the clinic shows a specificity of only 30-35 %. Ttds imphes that there is a high 
degree of felse positivily (60-65 %) associated with PSA. As a result of such low 
specificity, PSA is used only as a primary screen to detect PSA. Thus in this context. 
5 fiactionB6 provides amaricerformonitoringthe occurrence of PCA using serum 

samples to sq)plement PSA levels {see, e.g.. U.S. Patent App. Ser. No. 10/210.120 herein 
incorporated byieferenceinits entirety, for descriptions of cancer marker analysis). The 
low percentage of sensitivity observed wifli j&actionB6 becomes less of an issue since 
PCA is wide spread in the general population. 

1 0 The sensitivity and specificity of monitoring PCA using serum markers could be 

increasedbymultiplexingdata.ftommultiplebiomarkers.Withthisaim,additionalspote 

were investigated for their immunoreactive profile using.24 £enim samples, consisting of 
12 normals and 12 cancers (Figs 36-37). Proteins in these spots were identified using 
MALDI-TOF and their molecular weights were determined using the LCT-MS. Thus 
15 Figure 36 shows the profile of auto-antibodies against mitochondrial creatine kinase 
which was fractionated fiom LnCAP cells at pH 6-6.2. Based on the profile of 
immunoreactivity shown byPCA andnormalsamples a threshold value of 900 

normalized intensity units was used above wMch the serum was considered to show 
specific immune response. When such a criteria was used, 10 out the 12 PCA samples 
20 displayed a specific repertoire of auto-antibodies against mitochondrial creatine kinase. 
In contrast, only 2 out of 1 0 nonnals showed any reactivity. Thus the sensitivity and 
specificity for mitochondrial creatine kinase was 83%. Interestingly, mitochondrial 

creatine kinase has been associated with adenocarcinoma and its detection in sera has 
been correlated to poor outcome of the cancer. In this regard thepresent results showing 
25 the presence of auto-antibody reportoire.could be .due to over«q)Mi8sion of this protein in 
PCA. Another interesting proton which showed presence of auto-antibody repertoire was 

the 54 kDa non-POU domain containing nuclear RNA-DNA binding protein. This was 
also fractionated fiom LnCAP cells at pH 6-6.2 (Figure 37). Among the 12 PCA sera 

tested 10 of the serum showed specific immune response. Thus, using a novel me&od of 
30 fractionating proteins fiom cancer cells and protein microarrays. this shidy identified 3 
novel proteins that elicited a specific antibody response in PCA. Interestingly, the 
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Specificity of immune response to each of these three proteins showed high degree of 
specificity (58-93 %). This was much higher than the specificity exhibited by PSA (30-35 

An important issue in these experiments is the ability to identify the protein spots 
5 that show a humoral response. Identification is performed using a peptide mapping and 
MS/MS and MW measurements. ESI-TOF MS is used to measure the MW of the intact 
protein directly fiom the vial torn which ttie protein was spotted. The. sample can then be 
digested by trypsin and analyzed by MALDI-MS and MALDI-MS/MS for pq)tide 
ms^ping and database searching. The combination of intact MW and peptide mqjping 
10 and MS/MS generally provided a reliable match against the database. The use of peptide 
mapping alone often provided an unreliable identification with a relatively low MOWSE 
score. The MS/MS of several peptides using the MALDI QTOF MS usually provided the 
additional information needed to unanabiguously identify the protein. 

The use of CF fractionation provides a number of distinct advantages in these 
15 protein array experiments. The method results in separation of large numbers of purified 
proteins in the liquid phase. In this initial work over a pH range of 4-7 nearly 1400 
protein bands were observed by UV detection and collected. One could extend the range 
by changing the buffers so that additional proteins could be collected over a pH range up 
to 9.0. Further, the CF method has provided fractionation in intervals ofO.2 pi, where 
20 there is little overlap of proteins between adjacent pi fiactions. This contrasts to previous 
work using the Rotofor as a means of liquid phase pi fractionation, where pi lanes are 
-0.5 pi and there is often overlq) of proteins between adjacent pi fractions. The CF 
method allows fractionation ofvp to 0.05 pi units, thus enabling additional separation of 
overlapping proteins. The use of ESI-TOB MS has shown that some of the protein bands 
25 may consist of more than one protein so that an additional stage of firactionation or a 
narrower pl range could be used to achieve further isolation of proteins if required. 

An advantage of the CF method is that using analytical columns, it can provide a 
substantial amount of material for each protein band where between 3-5 mg of sample 
can be loaded on the column. This may result in as much as 1 jig of protein on average in 
30 each band so that significant amounts of protein may be spotted in each array spot. This 
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waibeimportantrnthe sensitivity for detection, especiaUy for proteins expressed at 
lower levels where sensitivity wiU be at a piemium. Also, the high reproducibiUty of the 
method allows, for additional experiments to be run where the same protein can be 
recollected for further analysis of the spot by mass spectroscopy and a variety of 
5 traditionalmethods-AltemativelylheCFexperimentcouldberunusingpreparative 

columns, however, this is at the expense of resolution, where it was found that it is 
preferable to run the analytical experiment several times to coUect additional materi^^ 

In addition, the use of 2-0 hquid phase separation of protems has several other distinct 
advantages for these microairay experiments. The CF fiactionation can be run in an hour 
10 whileeachofthel5ftactionscaneachbeseparatedbyNPSRP.HPLCinaround20 
minutes. The entire fiactionatioii can be performed in ~ 6 hours. M addition, the entire 
process can be automated so that large numbers of different ceU lines can be fractionated 
for spotting on an array, hi addition, the CF fiactionation provides protdns as expressed 

by the cells. Analysis of these proteins by mass spectrometry, shows that many of the 
1 5 proteins do not have MW values that correspond to that of the database and are modified. 
These modifications as expressed by the cell are often important in the ceU cycle and the 
onset of cancer. Modified isoforms of ^otem have been shown to have a very specific 
humoral response to patient plasma usmg 2-D gel-based studies. 

20 AU pubUcations and patents mentioned in the above specification are herem 

incorporated by reference. Various modifications and variations of the descnTjed method 

and system of flie invention will be apparent to those skilled in the art without departing 
from the scope and spirit of the invention. Althougji the invention has been described in 
connection witti specific preferred embodiments, it should be understood fliat the 
25 inventionasclaimedshouldnotbeundulylimitedto.8uchspecificembodimente^ hideed, 
various modifications of the described modes for carrying out the invention which are 
obvious to fiiose skilled in the art are intended to be within the scope of the following 
claims. 
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CLAIMS 

We Claim: 

5 1 . A method for generating protein microairays, comprising: 

a) providing: 

i) at least one sample comprising a plurality of polypeptides; 

ii) a first protein separation apparatus comprising a 
chromatofbcusing apparatus; 

10 iii) a second protein separation apparatus; 

iv) a solid surface; and 

b) treating said at least one sample with, said first protein separation 
^paratus to generate a first separated polypeptide preparation; 

c) treating said first separated polypeptide preparation with said 

15 second protein separation apparatus to generate a second sq)arated 

polypeptide preparation; and 

d) spotting at least a portion of said second separated polypeptide 
prq)aration onto said solid surface to generate a protein 
microarray. 

20 

2. The method of Claim 1 , wherein said solid surface comprises a 
nitrocellulose slide. 

3 . The method of Claim 1 i further comprising the step of exposing said 
25 protein microairay to a biological fluid. 

4. The method of Claim 3, wherein said biological fluid comprises sera. 

5. The method of Claim 4, wherein said sera is firom a subject having a 
30 disease. 
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6. The method of Claim 5, wherem said disease comprises cancer. 

7. The method of Qaim 3, further comprising the step of detecting antibody 
binding to said protein microairay. 

5 
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FIGURE 1 
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FIGURE 8B 
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FIGURE 9 
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FIGURE 12 
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FIGURE 13 
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FIGURE 14 
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FIGURE 16 
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Figure 17 
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Figure 18 
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Figure 3 1 . A 2-D map of the liquid separations of the LnCAP cells using 
chromatofocusing over a pH range of 4-7 followed by separation in the second dimension 
using NPSRPHPLC. 

Lncap cells (pH 7.0-4.0 with 02 pH interval): 
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Figure 32. The NFS RP HPLC separation of CF fraction #5 at pH 6.0-6.2. 




+— 1 — I — I — I — " — I — • — I — • — I — ' — I 

0 5 10 15 20 25 30 

Retention time (min) 



34/39 



wo 2004/063719 PCTAJS2004/000774 



Figure 33. Concentration profile as a function of pi for the choromatofocusing of ttie 
LnCAP cell line fix)m pH 7.0 to pH 4.0. 
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Figure 34: Detection of humoral response in serum from nprmal and prostate cancer 
patients. UiCAP cells were chromatofocussed at pi 6.0-6.2 and fractionated using non- 
porous silica based reverse phase HPLC. The fractions were denatured and spotted on 
nitrocellulose slides. The slides were probed with serum from prostate cancer patients (A) 
or Normal individuals (B). Specific fractions showed immunoreactivity with prostate 
cancer sera (arrows) that was not seen with seruin from normal individuals. 
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Figure 35. Detection of humoral response in prostate cancer against fraction B6. (A) 
LnCAP cells wae fractionated at pH6.8-7. The fractions were spotted on nitrocellulose- 
coated slides and probed with serum from normal and prostate cancer patients. Specific 
immunoieactivity was observed with fraction B6, when probed with prostate cancer sera 
but not with serum from normal individuals (A). When a total of 34 serum samples (15 
normals and 19 prostate cancer) were used for analysis, 8 out of the 19 prostate cancer 
son showed tiie presence of specific antibody repertoire against firaction B6, giving a 
sensitivity of 42 % (B). However, only 1 out of the 15 normal sera used showed the 
presnce of antibodies against fraction B6, giving a specificity of 93 % (B). 
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Figure 36 Detection of humoral response in prostate cancer against Creatine Kinase 
fractionated at pH 6-62. LnCAP cells were fractionated at pH 6-6.2, The fractions were 
spotted on nitrocellulose- coated slides and probed with serum from normal and prostate 
cancer patients. A total of 24 serum samples (12 normals and 12 prostate cancer) were 
used for analysis. 10 out of the 12 prostate cancer sera showed the presence of specific 
antibody repertoire against Creatine kinase, giving a sensitivity of 83 %. However, only 2 
out of the 12 normal sera used showed the presence of antibodies against creatine kinase. 

Humoral Response in Prostate Cancer to Creatine Kinase (Fraction 54, pi 6-6.2) 
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Figure 37 Detection of humoral response in prostate cancer against 54 kDa nuclear 
RNA-DNA binding protein fractionated at pH 6-6.2. LnCAP cells were fractionated 
at pH 6-6.2. The fractions were spotted on nitrocellulose- coated slides and probed 
with serum from normal and prostate cancer patients. A total of 24 serum samples (12 
normals and 12 prostate cancer) were used for analysis. 10 out of the 12 prostate 
cancer sera showed the presence of specific antibody repertoire against RNA-DNA 
binding protein, giving a sensitivity of 83 %. However, only 5 out of the 12 normal 
sea used showed the presence of antibodies against this piwtcin. 
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